Cargando…
Clover: a clustering-oriented de novo assembler for Illumina sequences
BACKGROUND: Next-generation sequencing technologies revolutionized genomics by producing high-throughput reads at low cost, and this progress has prompted the recent development of de novo assemblers. Multiple assembly methods based on de Bruijn graph have been shown to be efficient for Illumina rea...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7672897/ https://www.ncbi.nlm.nih.gov/pubmed/33203354 http://dx.doi.org/10.1186/s12859-020-03788-9 |
_version_ | 1783611227092549632 |
---|---|
author | Hsieh, Ming-Feng Lu, Chin Lung Tang, Chuan Yi |
author_facet | Hsieh, Ming-Feng Lu, Chin Lung Tang, Chuan Yi |
author_sort | Hsieh, Ming-Feng |
collection | PubMed |
description | BACKGROUND: Next-generation sequencing technologies revolutionized genomics by producing high-throughput reads at low cost, and this progress has prompted the recent development of de novo assemblers. Multiple assembly methods based on de Bruijn graph have been shown to be efficient for Illumina reads. However, the sequencing errors generated by the sequencer complicate analysis of de novo assembly and influence the quality of downstream genomic researches. RESULTS: In this paper, we develop a de Bruijn assembler, called Clover (clustering-oriented de novo assembler), that utilizes a novel k-mer clustering approach from the overlap-layout-consensus concept to deal with the sequencing errors generated by the Illumina platform. We further evaluate Clover’s performance against several de Bruijn graph assemblers (ABySS, SOAPdenovo, SPAdes and Velvet), overlap-layout-consensus assemblers (Bambus2, CABOG and MSR-CA) and string graph assembler (SGA) on three datasets (Staphylococcus aureus, Rhodobacter sphaeroides and human chromosome 14). The results show that Clover achieves a superior assembly quality in terms of corrected N50 and E-size while remaining a significantly competitive in run time except SOAPdenovo. In addition, Clover was involved in the sequencing projects of bacterial genomes Acinetobacter baumannii TYTH-1 and Morganella morganii KT. CONCLUSIONS: The marvel clustering-based approach of Clover that integrates the flexibility of the overlap-layout-consensus approach and the efficiency of the de Bruijn graph method has high potential on de novo assembly. Now, Clover is freely available as open source software from https://oz.nthu.edu.tw/~d9562563/src.html. |
format | Online Article Text |
id | pubmed-7672897 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-76728972020-11-19 Clover: a clustering-oriented de novo assembler for Illumina sequences Hsieh, Ming-Feng Lu, Chin Lung Tang, Chuan Yi BMC Bioinformatics Software BACKGROUND: Next-generation sequencing technologies revolutionized genomics by producing high-throughput reads at low cost, and this progress has prompted the recent development of de novo assemblers. Multiple assembly methods based on de Bruijn graph have been shown to be efficient for Illumina reads. However, the sequencing errors generated by the sequencer complicate analysis of de novo assembly and influence the quality of downstream genomic researches. RESULTS: In this paper, we develop a de Bruijn assembler, called Clover (clustering-oriented de novo assembler), that utilizes a novel k-mer clustering approach from the overlap-layout-consensus concept to deal with the sequencing errors generated by the Illumina platform. We further evaluate Clover’s performance against several de Bruijn graph assemblers (ABySS, SOAPdenovo, SPAdes and Velvet), overlap-layout-consensus assemblers (Bambus2, CABOG and MSR-CA) and string graph assembler (SGA) on three datasets (Staphylococcus aureus, Rhodobacter sphaeroides and human chromosome 14). The results show that Clover achieves a superior assembly quality in terms of corrected N50 and E-size while remaining a significantly competitive in run time except SOAPdenovo. In addition, Clover was involved in the sequencing projects of bacterial genomes Acinetobacter baumannii TYTH-1 and Morganella morganii KT. CONCLUSIONS: The marvel clustering-based approach of Clover that integrates the flexibility of the overlap-layout-consensus approach and the efficiency of the de Bruijn graph method has high potential on de novo assembly. Now, Clover is freely available as open source software from https://oz.nthu.edu.tw/~d9562563/src.html. BioMed Central 2020-11-17 /pmc/articles/PMC7672897/ /pubmed/33203354 http://dx.doi.org/10.1186/s12859-020-03788-9 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Software Hsieh, Ming-Feng Lu, Chin Lung Tang, Chuan Yi Clover: a clustering-oriented de novo assembler for Illumina sequences |
title | Clover: a clustering-oriented de novo assembler for Illumina sequences |
title_full | Clover: a clustering-oriented de novo assembler for Illumina sequences |
title_fullStr | Clover: a clustering-oriented de novo assembler for Illumina sequences |
title_full_unstemmed | Clover: a clustering-oriented de novo assembler for Illumina sequences |
title_short | Clover: a clustering-oriented de novo assembler for Illumina sequences |
title_sort | clover: a clustering-oriented de novo assembler for illumina sequences |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7672897/ https://www.ncbi.nlm.nih.gov/pubmed/33203354 http://dx.doi.org/10.1186/s12859-020-03788-9 |
work_keys_str_mv | AT hsiehmingfeng cloveraclusteringorienteddenovoassemblerforilluminasequences AT luchinlung cloveraclusteringorienteddenovoassemblerforilluminasequences AT tangchuanyi cloveraclusteringorienteddenovoassemblerforilluminasequences |