Cargando…

Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics

Third-generation sequencing has penetrated little in metagenomics due to the high error rate and dependence for assembly on short-read designed bioinformatics. However, second-generation sequencing metagenomics (mostly Illumina) suffers from limitations, particularly in the assembly of microbes with...

Descripción completa

Detalles Bibliográficos
Autores principales: Haro-Moreno, Jose M., López-Pérez, Mario, Rodriguez-Valera, Francisco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8430335/
https://www.ncbi.nlm.nih.gov/pubmed/34512586
http://dx.doi.org/10.3389/fmicb.2021.708782
_version_ 1783750684821159936
author Haro-Moreno, Jose M.
López-Pérez, Mario
Rodriguez-Valera, Francisco
author_facet Haro-Moreno, Jose M.
López-Pérez, Mario
Rodriguez-Valera, Francisco
author_sort Haro-Moreno, Jose M.
collection PubMed
description Third-generation sequencing has penetrated little in metagenomics due to the high error rate and dependence for assembly on short-read designed bioinformatics. However, second-generation sequencing metagenomics (mostly Illumina) suffers from limitations, particularly in the assembly of microbes with high microdiversity and retrieval of the flexible (adaptive) fraction of prokaryotic genomes. Here, we have used a third-generation technique to study the metagenome of a well-known marine sample from the mixed epipelagic water column of the winter Mediterranean. We have compared PacBio Sequel II with the classical approach using Illumina Nextseq short reads followed by assembly to study the metagenome. Long reads allow for efficient direct retrieval of complete genes avoiding the bias of the assembly step. Besides, the application of long reads on metagenomic assembly allows for the reconstruction of much more complete metagenome-assembled genomes (MAGs), particularly from microbes with high microdiversity such as Pelagibacterales. The flexible genome of reconstructed MAGs was much more complete containing many adaptive genes (some with biotechnological potential). PacBio Sequel II CCS appears particularly suitable for cellular metagenomics due to its low error rate. For most applications of metagenomics, from community structure analysis to ecosystem functioning, long reads should be applied whenever possible. Specifically, for in silico screening of biotechnologically useful genes, or population genomics, long-read metagenomics appears presently as a very fruitful approach and can be analyzed from raw reads before a computationally demanding (and potentially artifactual) assembly step.
format Online
Article
Text
id pubmed-8430335
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-84303352021-09-11 Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics Haro-Moreno, Jose M. López-Pérez, Mario Rodriguez-Valera, Francisco Front Microbiol Microbiology Third-generation sequencing has penetrated little in metagenomics due to the high error rate and dependence for assembly on short-read designed bioinformatics. However, second-generation sequencing metagenomics (mostly Illumina) suffers from limitations, particularly in the assembly of microbes with high microdiversity and retrieval of the flexible (adaptive) fraction of prokaryotic genomes. Here, we have used a third-generation technique to study the metagenome of a well-known marine sample from the mixed epipelagic water column of the winter Mediterranean. We have compared PacBio Sequel II with the classical approach using Illumina Nextseq short reads followed by assembly to study the metagenome. Long reads allow for efficient direct retrieval of complete genes avoiding the bias of the assembly step. Besides, the application of long reads on metagenomic assembly allows for the reconstruction of much more complete metagenome-assembled genomes (MAGs), particularly from microbes with high microdiversity such as Pelagibacterales. The flexible genome of reconstructed MAGs was much more complete containing many adaptive genes (some with biotechnological potential). PacBio Sequel II CCS appears particularly suitable for cellular metagenomics due to its low error rate. For most applications of metagenomics, from community structure analysis to ecosystem functioning, long reads should be applied whenever possible. Specifically, for in silico screening of biotechnologically useful genes, or population genomics, long-read metagenomics appears presently as a very fruitful approach and can be analyzed from raw reads before a computationally demanding (and potentially artifactual) assembly step. Frontiers Media S.A. 2021-08-27 /pmc/articles/PMC8430335/ /pubmed/34512586 http://dx.doi.org/10.3389/fmicb.2021.708782 Text en Copyright © 2021 Haro-Moreno, López-Pérez and Rodriguez-Valera. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Haro-Moreno, Jose M.
López-Pérez, Mario
Rodriguez-Valera, Francisco
Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics
title Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics
title_full Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics
title_fullStr Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics
title_full_unstemmed Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics
title_short Enhanced Recovery of Microbial Genes and Genomes From a Marine Water Column Using Long-Read Metagenomics
title_sort enhanced recovery of microbial genes and genomes from a marine water column using long-read metagenomics
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8430335/
https://www.ncbi.nlm.nih.gov/pubmed/34512586
http://dx.doi.org/10.3389/fmicb.2021.708782
work_keys_str_mv AT haromorenojosem enhancedrecoveryofmicrobialgenesandgenomesfromamarinewatercolumnusinglongreadmetagenomics
AT lopezperezmario enhancedrecoveryofmicrobialgenesandgenomesfromamarinewatercolumnusinglongreadmetagenomics
AT rodriguezvalerafrancisco enhancedrecoveryofmicrobialgenesandgenomesfromamarinewatercolumnusinglongreadmetagenomics