Cargando…

MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features

De novo metagenome assembly is effective in assembling multiple draft genomes, including those of uncultured organisms. However, heterogeneity in the metagenome hinders assembly and introduces interspecies misassembly deleterious for downstream analysis. For this purpose, we developed a hybrid metag...

Descripción completa

Detalles Bibliográficos
Autores principales: Kajitani, Rei, Noguchi, Hideki, Gotoh, Yasuhiro, Ogura, Yoshitoshi, Yoshimura, Dai, Okuno, Miki, Toyoda, Atsushi, Kuwahara, Tomomi, Hayashi, Tetsuya, Itoh, Takehiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8682757/
https://www.ncbi.nlm.nih.gov/pubmed/34570223
http://dx.doi.org/10.1093/nar/gkab831
_version_ 1784617291087872000
author Kajitani, Rei
Noguchi, Hideki
Gotoh, Yasuhiro
Ogura, Yoshitoshi
Yoshimura, Dai
Okuno, Miki
Toyoda, Atsushi
Kuwahara, Tomomi
Hayashi, Tetsuya
Itoh, Takehiko
author_facet Kajitani, Rei
Noguchi, Hideki
Gotoh, Yasuhiro
Ogura, Yoshitoshi
Yoshimura, Dai
Okuno, Miki
Toyoda, Atsushi
Kuwahara, Tomomi
Hayashi, Tetsuya
Itoh, Takehiko
author_sort Kajitani, Rei
collection PubMed
description De novo metagenome assembly is effective in assembling multiple draft genomes, including those of uncultured organisms. However, heterogeneity in the metagenome hinders assembly and introduces interspecies misassembly deleterious for downstream analysis. For this purpose, we developed a hybrid metagenome assembler, MetaPlatanus. First, as a characteristic function, it assembles the basic contigs from accurate short reads and then iteratively utilizes long-range sequence links, species-specific sequence compositions, and coverage depth. The binning information was also used to improve contiguity. Benchmarking using mock datasets consisting of known bacteria with long reads or mate pairs revealed the high contiguity MetaPlatanus with a few interspecies misassemblies. For published human gut data with nanopore reads from potable sequencers, MetaPlatanus assembled many biologically important elements, such as coding genes, gene clusters, viral sequences, and over-half bacterial genomes. In the benchmark with published human saliva data with high-throughput nanopore reads, the superiority of MetaPlatanus was considerably more evident. We found that some high-abundance bacterial genomes were assembled only by MetaPlatanus as near-complete. Furthermore, MetaPlatanus can circumvent the limitations of highly fragmented assemblies and frequent interspecies misassembles obtained by the other tools. Overall, the study demonstrates that MetaPlatanus could be an effective approach for exploring large-scale structures in metagenomes.
format Online
Article
Text
id pubmed-8682757
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86827572021-12-20 MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features Kajitani, Rei Noguchi, Hideki Gotoh, Yasuhiro Ogura, Yoshitoshi Yoshimura, Dai Okuno, Miki Toyoda, Atsushi Kuwahara, Tomomi Hayashi, Tetsuya Itoh, Takehiko Nucleic Acids Res Methods Online De novo metagenome assembly is effective in assembling multiple draft genomes, including those of uncultured organisms. However, heterogeneity in the metagenome hinders assembly and introduces interspecies misassembly deleterious for downstream analysis. For this purpose, we developed a hybrid metagenome assembler, MetaPlatanus. First, as a characteristic function, it assembles the basic contigs from accurate short reads and then iteratively utilizes long-range sequence links, species-specific sequence compositions, and coverage depth. The binning information was also used to improve contiguity. Benchmarking using mock datasets consisting of known bacteria with long reads or mate pairs revealed the high contiguity MetaPlatanus with a few interspecies misassemblies. For published human gut data with nanopore reads from potable sequencers, MetaPlatanus assembled many biologically important elements, such as coding genes, gene clusters, viral sequences, and over-half bacterial genomes. In the benchmark with published human saliva data with high-throughput nanopore reads, the superiority of MetaPlatanus was considerably more evident. We found that some high-abundance bacterial genomes were assembled only by MetaPlatanus as near-complete. Furthermore, MetaPlatanus can circumvent the limitations of highly fragmented assemblies and frequent interspecies misassembles obtained by the other tools. Overall, the study demonstrates that MetaPlatanus could be an effective approach for exploring large-scale structures in metagenomes. Oxford University Press 2021-09-27 /pmc/articles/PMC8682757/ /pubmed/34570223 http://dx.doi.org/10.1093/nar/gkab831 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Kajitani, Rei
Noguchi, Hideki
Gotoh, Yasuhiro
Ogura, Yoshitoshi
Yoshimura, Dai
Okuno, Miki
Toyoda, Atsushi
Kuwahara, Tomomi
Hayashi, Tetsuya
Itoh, Takehiko
MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features
title MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features
title_full MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features
title_fullStr MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features
title_full_unstemmed MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features
title_short MetaPlatanus: a metagenome assembler that combines long-range sequence links and species-specific features
title_sort metaplatanus: a metagenome assembler that combines long-range sequence links and species-specific features
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8682757/
https://www.ncbi.nlm.nih.gov/pubmed/34570223
http://dx.doi.org/10.1093/nar/gkab831
work_keys_str_mv AT kajitanirei metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT noguchihideki metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT gotohyasuhiro metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT ogurayoshitoshi metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT yoshimuradai metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT okunomiki metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT toyodaatsushi metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT kuwaharatomomi metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT hayashitetsuya metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures
AT itohtakehiko metaplatanusametagenomeassemblerthatcombineslongrangesequencelinksandspeciesspecificfeatures