Cargando…
MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly
BACKGROUND: The increasing use of whole metagenome sequencing has spurred the need to improve de novo assemblers to facilitate the discovery of unknown species and the analysis of their genomic functions. MetaVelvet-SL is a short-read de novo metagenome assembler that partitions a multi-species de B...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171044/ https://www.ncbi.nlm.nih.gov/pubmed/34078257 http://dx.doi.org/10.1186/s12859-020-03737-6 |
_version_ | 1783702355199393792 |
---|---|
author | Liang, Kuo-ching Sakakibara, Yasubumi |
author_facet | Liang, Kuo-ching Sakakibara, Yasubumi |
author_sort | Liang, Kuo-ching |
collection | PubMed |
description | BACKGROUND: The increasing use of whole metagenome sequencing has spurred the need to improve de novo assemblers to facilitate the discovery of unknown species and the analysis of their genomic functions. MetaVelvet-SL is a short-read de novo metagenome assembler that partitions a multi-species de Bruijn graph into single-species sub-graphs. This study aimed to improve the performance of MetaVelvet-SL by using a deep learning-based model to predict the partition nodes in a multi-species de Bruijn graph. RESULTS: This study showed that the recent advances in deep learning offer the opportunity to better exploit sequence information and differentiate genomes of different species in a metagenomic sample. We developed an extension to MetaVelvet-SL, which we named MetaVelvet-DL, that builds an end-to-end architecture using Convolutional Neural Network and Long Short-Term Memory units. The deep learning model in MetaVelvet-DL can more accurately predict how to partition a de Bruijn graph than the Support Vector Machine-based model in MetaVelvet-SL can. Assembly of the Critical Assessment of Metagenome Interpretation (CAMI) dataset showed that after removing chimeric assemblies, MetaVelvet-DL produced longer single-species contigs, with less misassembled contigs than MetaVelvet-SL did. CONCLUSIONS: MetaVelvet-DL provides more accurate de novo assemblies of whole metagenome data. The authors believe that this improvement can help in furthering the understanding of microbiomes by providing a more accurate description of the metagenomic samples under analysis. |
format | Online Article Text |
id | pubmed-8171044 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-81710442021-06-03 MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly Liang, Kuo-ching Sakakibara, Yasubumi BMC Bioinformatics Research BACKGROUND: The increasing use of whole metagenome sequencing has spurred the need to improve de novo assemblers to facilitate the discovery of unknown species and the analysis of their genomic functions. MetaVelvet-SL is a short-read de novo metagenome assembler that partitions a multi-species de Bruijn graph into single-species sub-graphs. This study aimed to improve the performance of MetaVelvet-SL by using a deep learning-based model to predict the partition nodes in a multi-species de Bruijn graph. RESULTS: This study showed that the recent advances in deep learning offer the opportunity to better exploit sequence information and differentiate genomes of different species in a metagenomic sample. We developed an extension to MetaVelvet-SL, which we named MetaVelvet-DL, that builds an end-to-end architecture using Convolutional Neural Network and Long Short-Term Memory units. The deep learning model in MetaVelvet-DL can more accurately predict how to partition a de Bruijn graph than the Support Vector Machine-based model in MetaVelvet-SL can. Assembly of the Critical Assessment of Metagenome Interpretation (CAMI) dataset showed that after removing chimeric assemblies, MetaVelvet-DL produced longer single-species contigs, with less misassembled contigs than MetaVelvet-SL did. CONCLUSIONS: MetaVelvet-DL provides more accurate de novo assemblies of whole metagenome data. The authors believe that this improvement can help in furthering the understanding of microbiomes by providing a more accurate description of the metagenomic samples under analysis. BioMed Central 2021-06-02 /pmc/articles/PMC8171044/ /pubmed/34078257 http://dx.doi.org/10.1186/s12859-020-03737-6 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Liang, Kuo-ching Sakakibara, Yasubumi MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly |
title | MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly |
title_full | MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly |
title_fullStr | MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly |
title_full_unstemmed | MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly |
title_short | MetaVelvet-DL: a MetaVelvet deep learning extension for de novo metagenome assembly |
title_sort | metavelvet-dl: a metavelvet deep learning extension for de novo metagenome assembly |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8171044/ https://www.ncbi.nlm.nih.gov/pubmed/34078257 http://dx.doi.org/10.1186/s12859-020-03737-6 |
work_keys_str_mv | AT liangkuoching metavelvetdlametavelvetdeeplearningextensionfordenovometagenomeassembly AT sakakibarayasubumi metavelvetdlametavelvetdeeplearningextensionfordenovometagenomeassembly |