Cargando…

Sequencing effort dictates gene discovery in marine microbial metagenomes

Massive metagenomic sequencing combined with gene prediction methods were previously used to compile the gene catalogue of the ocean and host‐associated microbes. Global expeditions conducted over the past 15 years have sampled the ocean to build a catalogue of genes from pelagic microbes. Here we u...

Descripción completa

Detalles Bibliográficos
Autores principales: Duarte, Carlos M., Ngugi, David K., Alam, Intikhab, Pearman, John, Kamau, Allan, Eguiluz, Victor M., Gojobori, Takashi, Acinas, Silvia G., Gasol, Josep M., Bajic, Vladimir, Irigoien, Xabier
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley & Sons, Inc. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756799/
https://www.ncbi.nlm.nih.gov/pubmed/32743860
http://dx.doi.org/10.1111/1462-2920.15182
_version_ 1783626620277358592
author Duarte, Carlos M.
Ngugi, David K.
Alam, Intikhab
Pearman, John
Kamau, Allan
Eguiluz, Victor M.
Gojobori, Takashi
Acinas, Silvia G.
Gasol, Josep M.
Bajic, Vladimir
Irigoien, Xabier
author_facet Duarte, Carlos M.
Ngugi, David K.
Alam, Intikhab
Pearman, John
Kamau, Allan
Eguiluz, Victor M.
Gojobori, Takashi
Acinas, Silvia G.
Gasol, Josep M.
Bajic, Vladimir
Irigoien, Xabier
author_sort Duarte, Carlos M.
collection PubMed
description Massive metagenomic sequencing combined with gene prediction methods were previously used to compile the gene catalogue of the ocean and host‐associated microbes. Global expeditions conducted over the past 15 years have sampled the ocean to build a catalogue of genes from pelagic microbes. Here we undertook a large sequencing effort of a perturbed Red Sea plankton community to uncover that the rate of gene discovery increases continuously with sequencing effort, with no indication that the retrieved 2.83 million non‐redundant (complete) genes predicted from the experiment represented a nearly complete inventory of the genes present in the sampled community (i.e., no evidence of saturation). The underlying reason is the Pareto‐like distribution of the abundance of genes in the plankton community, resulting in a very long tail of millions of genes present at remarkably low abundances, which can only be retrieved through massive sequencing. Microbial metagenomic projects retrieve a variable number of unique genes per Tera base‐pair (Tbp), with a median value of 14.7 million unique genes per Tbp sequenced across projects. The increase in the rate of gene discovery in microbial metagenomes with sequencing effort implies that there is ample room for new gene discovery in further ocean and holobiont sequencing studies.
format Online
Article
Text
id pubmed-7756799
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher John Wiley & Sons, Inc.
record_format MEDLINE/PubMed
spelling pubmed-77567992020-12-28 Sequencing effort dictates gene discovery in marine microbial metagenomes Duarte, Carlos M. Ngugi, David K. Alam, Intikhab Pearman, John Kamau, Allan Eguiluz, Victor M. Gojobori, Takashi Acinas, Silvia G. Gasol, Josep M. Bajic, Vladimir Irigoien, Xabier Environ Microbiol Research Articles Massive metagenomic sequencing combined with gene prediction methods were previously used to compile the gene catalogue of the ocean and host‐associated microbes. Global expeditions conducted over the past 15 years have sampled the ocean to build a catalogue of genes from pelagic microbes. Here we undertook a large sequencing effort of a perturbed Red Sea plankton community to uncover that the rate of gene discovery increases continuously with sequencing effort, with no indication that the retrieved 2.83 million non‐redundant (complete) genes predicted from the experiment represented a nearly complete inventory of the genes present in the sampled community (i.e., no evidence of saturation). The underlying reason is the Pareto‐like distribution of the abundance of genes in the plankton community, resulting in a very long tail of millions of genes present at remarkably low abundances, which can only be retrieved through massive sequencing. Microbial metagenomic projects retrieve a variable number of unique genes per Tera base‐pair (Tbp), with a median value of 14.7 million unique genes per Tbp sequenced across projects. The increase in the rate of gene discovery in microbial metagenomes with sequencing effort implies that there is ample room for new gene discovery in further ocean and holobiont sequencing studies. John Wiley & Sons, Inc. 2020-09-29 2020-11 /pmc/articles/PMC7756799/ /pubmed/32743860 http://dx.doi.org/10.1111/1462-2920.15182 Text en © 2020 The Authors. Environmental Microbiology published by Society for Applied Microbiology and John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Research Articles
Duarte, Carlos M.
Ngugi, David K.
Alam, Intikhab
Pearman, John
Kamau, Allan
Eguiluz, Victor M.
Gojobori, Takashi
Acinas, Silvia G.
Gasol, Josep M.
Bajic, Vladimir
Irigoien, Xabier
Sequencing effort dictates gene discovery in marine microbial metagenomes
title Sequencing effort dictates gene discovery in marine microbial metagenomes
title_full Sequencing effort dictates gene discovery in marine microbial metagenomes
title_fullStr Sequencing effort dictates gene discovery in marine microbial metagenomes
title_full_unstemmed Sequencing effort dictates gene discovery in marine microbial metagenomes
title_short Sequencing effort dictates gene discovery in marine microbial metagenomes
title_sort sequencing effort dictates gene discovery in marine microbial metagenomes
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7756799/
https://www.ncbi.nlm.nih.gov/pubmed/32743860
http://dx.doi.org/10.1111/1462-2920.15182
work_keys_str_mv AT duartecarlosm sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT ngugidavidk sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT alamintikhab sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT pearmanjohn sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT kamauallan sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT eguiluzvictorm sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT gojoboritakashi sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT acinassilviag sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT gasoljosepm sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT bajicvladimir sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes
AT irigoienxabier sequencingeffortdictatesgenediscoveryinmarinemicrobialmetagenomes