Cargando…

A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies

The advent of next-generation sequencing technologies is accompanied with the development of many whole-genome sequence assembly methods and software, especially for de novo fragment assembly. Due to the poor knowledge about the applicability and performance of these software tools, choosing a befit...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Wenyu, Chen, Jiajia, Yang, Yang, Tang, Yifei, Shang, Jing, Shen, Bairong
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3056720/
https://www.ncbi.nlm.nih.gov/pubmed/21423806
http://dx.doi.org/10.1371/journal.pone.0017915
_version_ 1782200227150692352
author Zhang, Wenyu
Chen, Jiajia
Yang, Yang
Tang, Yifei
Shang, Jing
Shen, Bairong
author_facet Zhang, Wenyu
Chen, Jiajia
Yang, Yang
Tang, Yifei
Shang, Jing
Shen, Bairong
author_sort Zhang, Wenyu
collection PubMed
description The advent of next-generation sequencing technologies is accompanied with the development of many whole-genome sequence assembly methods and software, especially for de novo fragment assembly. Due to the poor knowledge about the applicability and performance of these software tools, choosing a befitting assembler becomes a tough task. Here, we provide the information of adaptivity for each program, then above all, compare the performance of eight distinct tools against eight groups of simulated datasets from Solexa sequencing platform. Considering the computational time, maximum random access memory (RAM) occupancy, assembly accuracy and integrity, our study indicate that string-based assemblers, overlap-layout-consensus (OLC) assemblers are well-suited for very short reads and longer reads of small genomes respectively. For large datasets of more than hundred millions of short reads, De Bruijn graph-based assemblers would be more appropriate. In terms of software implementation, string-based assemblers are superior to graph-based ones, of which SOAPdenovo is complex for the creation of configuration file. Our comparison study will assist researchers in selecting a well-suited assembler and offer essential information for the improvement of existing assemblers or the developing of novel assemblers.
format Text
id pubmed-3056720
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30567202011-03-18 A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies Zhang, Wenyu Chen, Jiajia Yang, Yang Tang, Yifei Shang, Jing Shen, Bairong PLoS One Research Article The advent of next-generation sequencing technologies is accompanied with the development of many whole-genome sequence assembly methods and software, especially for de novo fragment assembly. Due to the poor knowledge about the applicability and performance of these software tools, choosing a befitting assembler becomes a tough task. Here, we provide the information of adaptivity for each program, then above all, compare the performance of eight distinct tools against eight groups of simulated datasets from Solexa sequencing platform. Considering the computational time, maximum random access memory (RAM) occupancy, assembly accuracy and integrity, our study indicate that string-based assemblers, overlap-layout-consensus (OLC) assemblers are well-suited for very short reads and longer reads of small genomes respectively. For large datasets of more than hundred millions of short reads, De Bruijn graph-based assemblers would be more appropriate. In terms of software implementation, string-based assemblers are superior to graph-based ones, of which SOAPdenovo is complex for the creation of configuration file. Our comparison study will assist researchers in selecting a well-suited assembler and offer essential information for the improvement of existing assemblers or the developing of novel assemblers. Public Library of Science 2011-03-14 /pmc/articles/PMC3056720/ /pubmed/21423806 http://dx.doi.org/10.1371/journal.pone.0017915 Text en Zhang et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhang, Wenyu
Chen, Jiajia
Yang, Yang
Tang, Yifei
Shang, Jing
Shen, Bairong
A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies
title A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies
title_full A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies
title_fullStr A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies
title_full_unstemmed A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies
title_short A Practical Comparison of De Novo Genome Assembly Software Tools for Next-Generation Sequencing Technologies
title_sort practical comparison of de novo genome assembly software tools for next-generation sequencing technologies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3056720/
https://www.ncbi.nlm.nih.gov/pubmed/21423806
http://dx.doi.org/10.1371/journal.pone.0017915
work_keys_str_mv AT zhangwenyu apracticalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT chenjiajia apracticalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT yangyang apracticalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT tangyifei apracticalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT shangjing apracticalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT shenbairong apracticalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT zhangwenyu practicalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT chenjiajia practicalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT yangyang practicalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT tangyifei practicalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT shangjing practicalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies
AT shenbairong practicalcomparisonofdenovogenomeassemblysoftwaretoolsfornextgenerationsequencingtechnologies