Cargando…

MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly

BACKGROUND: The increasing use of whole metagenome sequencing has spurred the need to improve de novo assemblers to facilitate the discovery of unknown species and the analysis of their genomic functions. MetaVelvet-SL is a short-read de novo metagenome assembler that partitions a multi-species de B...

Descripción completa

Detalles Bibliográficos
Autores principales: Liang, Kuo-ching, Sakakibara, Yasubumi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171044/
https://www.ncbi.nlm.nih.gov/pubmed/34078257
http://dx.doi.org/10.1186/s12859-020-03737-6
_version_ 1783702355199393792
author Liang, Kuo-ching
Sakakibara, Yasubumi
author_facet Liang, Kuo-ching
Sakakibara, Yasubumi
author_sort Liang, Kuo-ching
collection PubMed
description BACKGROUND: The increasing use of whole metagenome sequencing has spurred the need to improve de novo assemblers to facilitate the discovery of unknown species and the analysis of their genomic functions. MetaVelvet-SL is a short-read de novo metagenome assembler that partitions a multi-species de Bruijn graph into single-species sub-graphs. This study aimed to improve the performance of MetaVelvet-SL by using a deep learning-based model to predict the partition nodes in a multi-species de Bruijn graph. RESULTS: This study showed that the recent advances in deep learning offer the opportunity to better exploit sequence information and differentiate genomes of different species in a metagenomic sample. We developed an extension to MetaVelvet-SL, which we named MetaVelvet-DL, that builds an end-to-end architecture using Convolutional Neural Network and Long Short-Term Memory units. The deep learning model in MetaVelvet-DL can more accurately predict how to partition a de Bruijn graph than the Support Vector Machine-based model in MetaVelvet-SL can. Assembly of the Critical Assessment of Metagenome Interpretation (CAMI) dataset showed that after removing chimeric assemblies, MetaVelvet-DL produced longer single-species contigs, with less misassembled contigs than MetaVelvet-SL did. CONCLUSIONS: MetaVelvet-DL provides more accurate de novo assemblies of whole metagenome data. The authors believe that this improvement can help in furthering the understanding of microbiomes by providing a more accurate description of the metagenomic samples under analysis.
format Online
Article
Text
id pubmed-8171044
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-81710442021-06-03 MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly Liang, Kuo-ching Sakakibara, Yasubumi BMC Bioinformatics Research BACKGROUND: The increasing use of whole metagenome sequencing has spurred the need to improve de novo assemblers to facilitate the discovery of unknown species and the analysis of their genomic functions. MetaVelvet-SL is a short-read de novo metagenome assembler that partitions a multi-species de Bruijn graph into single-species sub-graphs. This study aimed to improve the performance of MetaVelvet-SL by using a deep learning-based model to predict the partition nodes in a multi-species de Bruijn graph. RESULTS: This study showed that the recent advances in deep learning offer the opportunity to better exploit sequence information and differentiate genomes of different species in a metagenomic sample. We developed an extension to MetaVelvet-SL, which we named MetaVelvet-DL, that builds an end-to-end architecture using Convolutional Neural Network and Long Short-Term Memory units. The deep learning model in MetaVelvet-DL can more accurately predict how to partition a de Bruijn graph than the Support Vector Machine-based model in MetaVelvet-SL can. Assembly of the Critical Assessment of Metagenome Interpretation (CAMI) dataset showed that after removing chimeric assemblies, MetaVelvet-DL produced longer single-species contigs, with less misassembled contigs than MetaVelvet-SL did. CONCLUSIONS: MetaVelvet-DL provides more accurate de novo assemblies of whole metagenome data. The authors believe that this improvement can help in furthering the understanding of microbiomes by providing a more accurate description of the metagenomic samples under analysis. BioMed Central 2021-06-02 /pmc/articles/PMC8171044/ /pubmed/34078257 http://dx.doi.org/10.1186/s12859-020-03737-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Liang, Kuo-ching
Sakakibara, Yasubumi
MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly
title MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly
title_full MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly
title_fullStr MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly
title_full_unstemmed MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly
title_short MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly
title_sort metavelvet-dl: a metavelvet deep learning extension for de novo metagenome assembly
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171044/
https://www.ncbi.nlm.nih.gov/pubmed/34078257
http://dx.doi.org/10.1186/s12859-020-03737-6
work_keys_str_mv AT liangkuoching metavelvetdlametavelvetdeeplearningextensionfordenovometagenomeassembly
AT sakakibarayasubumi metavelvetdlametavelvetdeeplearningextensionfordenovometagenomeassembly