Cargando…

Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing

BACKGROUND: Microbial eukaryotes are found alongside bacteria and archaea in natural microbial systems, including host-associated microbiomes. While microbial eukaryotes are critical to these communities, they are challenging to study with shotgun sequencing techniques and are therefore often exclud...

Descripción completa

Detalles Bibliográficos
Autores principales: Lind, Abigail L., Pollard, Katherine S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931531/
https://www.ncbi.nlm.nih.gov/pubmed/33658077
http://dx.doi.org/10.1186/s40168-021-01015-y
_version_ 1783660313759973376
author Lind, Abigail L.
Pollard, Katherine S.
author_facet Lind, Abigail L.
Pollard, Katherine S.
author_sort Lind, Abigail L.
collection PubMed
description BACKGROUND: Microbial eukaryotes are found alongside bacteria and archaea in natural microbial systems, including host-associated microbiomes. While microbial eukaryotes are critical to these communities, they are challenging to study with shotgun sequencing techniques and are therefore often excluded. RESULTS: Here, we present EukDetect, a bioinformatics method to identify eukaryotes in shotgun metagenomic sequencing data. Our approach uses a database of 521,824 universal marker genes from 241 conserved gene families, which we curated from 3713 fungal, protist, non-vertebrate metazoan, and non-streptophyte archaeplastida genomes and transcriptomes. EukDetect has a broad taxonomic coverage of microbial eukaryotes, performs well on low-abundance and closely related species, and is resilient against bacterial contamination in eukaryotic genomes. Using EukDetect, we describe the spatial distribution of eukaryotes along the human gastrointestinal tract, showing that fungi and protists are present in the lumen and mucosa throughout the large intestine. We discover that there is a succession of eukaryotes that colonize the human gut during the first years of life, mirroring patterns of developmental succession observed in gut bacteria. By comparing DNA and RNA sequencing of paired samples from human stool, we find that many eukaryotes continue active transcription after passage through the gut, though some do not, suggesting they are dormant or nonviable. We analyze metagenomic data from the Baltic Sea and find that eukaryotes differ across locations and salinity gradients. Finally, we observe eukaryotes in Arabidopsis leaf samples, many of which are not identifiable from public protein databases. CONCLUSIONS: EukDetect provides an automated and reliable way to characterize eukaryotes in shotgun sequencing datasets from diverse microbiomes. We demonstrate that it enables discoveries that would be missed or clouded by false positives with standard shotgun sequence analysis. EukDetect will greatly advance our understanding of how microbial eukaryotes contribute to microbiomes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01015-y.
format Online
Article
Text
id pubmed-7931531
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79315312021-03-05 Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing Lind, Abigail L. Pollard, Katherine S. Microbiome Methodology BACKGROUND: Microbial eukaryotes are found alongside bacteria and archaea in natural microbial systems, including host-associated microbiomes. While microbial eukaryotes are critical to these communities, they are challenging to study with shotgun sequencing techniques and are therefore often excluded. RESULTS: Here, we present EukDetect, a bioinformatics method to identify eukaryotes in shotgun metagenomic sequencing data. Our approach uses a database of 521,824 universal marker genes from 241 conserved gene families, which we curated from 3713 fungal, protist, non-vertebrate metazoan, and non-streptophyte archaeplastida genomes and transcriptomes. EukDetect has a broad taxonomic coverage of microbial eukaryotes, performs well on low-abundance and closely related species, and is resilient against bacterial contamination in eukaryotic genomes. Using EukDetect, we describe the spatial distribution of eukaryotes along the human gastrointestinal tract, showing that fungi and protists are present in the lumen and mucosa throughout the large intestine. We discover that there is a succession of eukaryotes that colonize the human gut during the first years of life, mirroring patterns of developmental succession observed in gut bacteria. By comparing DNA and RNA sequencing of paired samples from human stool, we find that many eukaryotes continue active transcription after passage through the gut, though some do not, suggesting they are dormant or nonviable. We analyze metagenomic data from the Baltic Sea and find that eukaryotes differ across locations and salinity gradients. Finally, we observe eukaryotes in Arabidopsis leaf samples, many of which are not identifiable from public protein databases. CONCLUSIONS: EukDetect provides an automated and reliable way to characterize eukaryotes in shotgun sequencing datasets from diverse microbiomes. We demonstrate that it enables discoveries that would be missed or clouded by false positives with standard shotgun sequence analysis. EukDetect will greatly advance our understanding of how microbial eukaryotes contribute to microbiomes. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40168-021-01015-y. BioMed Central 2021-03-03 /pmc/articles/PMC7931531/ /pubmed/33658077 http://dx.doi.org/10.1186/s40168-021-01015-y Text en © The Author(s) 2021 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Methodology
Lind, Abigail L.
Pollard, Katherine S.
Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing
title Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing
title_full Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing
title_fullStr Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing
title_full_unstemmed Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing
title_short Accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing
title_sort accurate and sensitive detection of microbial eukaryotes from whole metagenome shotgun sequencing
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7931531/
https://www.ncbi.nlm.nih.gov/pubmed/33658077
http://dx.doi.org/10.1186/s40168-021-01015-y
work_keys_str_mv AT lindabigaill accurateandsensitivedetectionofmicrobialeukaryotesfromwholemetagenomeshotgunsequencing
AT pollardkatherines accurateandsensitivedetectionofmicrobialeukaryotesfromwholemetagenomeshotgunsequencing