Cargando…

The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples

BACKGROUND: Shotgun metagenomics is increasingly used to characterise microbial communities, particularly for the investigation of antimicrobial resistance (AMR) in different animal and environmental contexts. There are many different approaches for inferring the taxonomic composition and AMR gene c...

Descripción completa

Detalles Bibliográficos
Autores principales: Gweon, H. Soon, Shaw, Liam P., Swann, Jeremy, De Maio, Nicola, AbuOun, Manal, Niehus, Rene, Hubbard, Alasdair T. M., Bowes, Mike J., Bailey, Mark J., Peto, Tim E. A., Hoosdally, Sarah J., Walker, A. Sarah, Sebra, Robert P., Crook, Derrick W., Anjum, Muna F., Read, Daniel S., Stoesser, Nicole
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8204541/
https://www.ncbi.nlm.nih.gov/pubmed/33902704
http://dx.doi.org/10.1186/s40793-019-0347-1
_version_ 1783708358580109312
author Gweon, H. Soon
Shaw, Liam P.
Swann, Jeremy
De Maio, Nicola
AbuOun, Manal
Niehus, Rene
Hubbard, Alasdair T. M.
Bowes, Mike J.
Bailey, Mark J.
Peto, Tim E. A.
Hoosdally, Sarah J.
Walker, A. Sarah
Sebra, Robert P.
Crook, Derrick W.
Anjum, Muna F.
Read, Daniel S.
Stoesser, Nicole
author_facet Gweon, H. Soon
Shaw, Liam P.
Swann, Jeremy
De Maio, Nicola
AbuOun, Manal
Niehus, Rene
Hubbard, Alasdair T. M.
Bowes, Mike J.
Bailey, Mark J.
Peto, Tim E. A.
Hoosdally, Sarah J.
Walker, A. Sarah
Sebra, Robert P.
Crook, Derrick W.
Anjum, Muna F.
Read, Daniel S.
Stoesser, Nicole
author_sort Gweon, H. Soon
collection PubMed
description BACKGROUND: Shotgun metagenomics is increasingly used to characterise microbial communities, particularly for the investigation of antimicrobial resistance (AMR) in different animal and environmental contexts. There are many different approaches for inferring the taxonomic composition and AMR gene content of complex community samples from shotgun metagenomic data, but there has been little work establishing the optimum sequencing depth, data processing and analysis methods for these samples. In this study we used shotgun metagenomics and sequencing of cultured isolates from the same samples to address these issues. We sampled three potential environmental AMR gene reservoirs (pig caeca, river sediment, effluent) and sequenced samples with shotgun metagenomics at high depth (~ 200 million reads per sample). Alongside this, we cultured single-colony isolates of Enterobacteriaceae from the same samples and used hybrid sequencing (short- and long-reads) to create high-quality assemblies for comparison to the metagenomic data. To automate data processing, we developed an open-source software pipeline, ‘ResPipe’. RESULTS: Taxonomic profiling was much more stable to sequencing depth than AMR gene content. 1 million reads per sample was sufficient to achieve < 1% dissimilarity to the full taxonomic composition. However, at least 80 million reads per sample were required to recover the full richness of different AMR gene families present in the sample, and additional allelic diversity of AMR genes was still being discovered in effluent at 200 million reads per sample. Normalising the number of reads mapping to AMR genes using gene length and an exogenous spike of Thermus thermophilus DNA substantially changed the estimated gene abundance distributions. While the majority of genomic content from cultured isolates from effluent was recoverable using shotgun metagenomics, this was not the case for pig caeca or river sediment. CONCLUSIONS: Sequencing depth and profiling method can critically affect the profiling of polymicrobial animal and environmental samples with shotgun metagenomics. Both sequencing of cultured isolates and shotgun metagenomics can recover substantial diversity that is not identified using the other methods. Particular consideration is required when inferring AMR gene content or presence by mapping metagenomic reads to a database. ResPipe, the open-source software pipeline we have developed, is freely available (https://gitlab.com/hsgweon/ResPipe).
format Online
Article
Text
id pubmed-8204541
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82045412021-06-16 The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples Gweon, H. Soon Shaw, Liam P. Swann, Jeremy De Maio, Nicola AbuOun, Manal Niehus, Rene Hubbard, Alasdair T. M. Bowes, Mike J. Bailey, Mark J. Peto, Tim E. A. Hoosdally, Sarah J. Walker, A. Sarah Sebra, Robert P. Crook, Derrick W. Anjum, Muna F. Read, Daniel S. Stoesser, Nicole Environ Microbiome Research Article BACKGROUND: Shotgun metagenomics is increasingly used to characterise microbial communities, particularly for the investigation of antimicrobial resistance (AMR) in different animal and environmental contexts. There are many different approaches for inferring the taxonomic composition and AMR gene content of complex community samples from shotgun metagenomic data, but there has been little work establishing the optimum sequencing depth, data processing and analysis methods for these samples. In this study we used shotgun metagenomics and sequencing of cultured isolates from the same samples to address these issues. We sampled three potential environmental AMR gene reservoirs (pig caeca, river sediment, effluent) and sequenced samples with shotgun metagenomics at high depth (~ 200 million reads per sample). Alongside this, we cultured single-colony isolates of Enterobacteriaceae from the same samples and used hybrid sequencing (short- and long-reads) to create high-quality assemblies for comparison to the metagenomic data. To automate data processing, we developed an open-source software pipeline, ‘ResPipe’. RESULTS: Taxonomic profiling was much more stable to sequencing depth than AMR gene content. 1 million reads per sample was sufficient to achieve < 1% dissimilarity to the full taxonomic composition. However, at least 80 million reads per sample were required to recover the full richness of different AMR gene families present in the sample, and additional allelic diversity of AMR genes was still being discovered in effluent at 200 million reads per sample. Normalising the number of reads mapping to AMR genes using gene length and an exogenous spike of Thermus thermophilus DNA substantially changed the estimated gene abundance distributions. While the majority of genomic content from cultured isolates from effluent was recoverable using shotgun metagenomics, this was not the case for pig caeca or river sediment. CONCLUSIONS: Sequencing depth and profiling method can critically affect the profiling of polymicrobial animal and environmental samples with shotgun metagenomics. Both sequencing of cultured isolates and shotgun metagenomics can recover substantial diversity that is not identified using the other methods. Particular consideration is required when inferring AMR gene content or presence by mapping metagenomic reads to a database. ResPipe, the open-source software pipeline we have developed, is freely available (https://gitlab.com/hsgweon/ResPipe). BioMed Central 2019-10-24 /pmc/articles/PMC8204541/ /pubmed/33902704 http://dx.doi.org/10.1186/s40793-019-0347-1 Text en © The Author(s). 2019 https://creativecommons.org/licenses/by/4.0/Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) ), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Gweon, H. Soon
Shaw, Liam P.
Swann, Jeremy
De Maio, Nicola
AbuOun, Manal
Niehus, Rene
Hubbard, Alasdair T. M.
Bowes, Mike J.
Bailey, Mark J.
Peto, Tim E. A.
Hoosdally, Sarah J.
Walker, A. Sarah
Sebra, Robert P.
Crook, Derrick W.
Anjum, Muna F.
Read, Daniel S.
Stoesser, Nicole
The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
title The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
title_full The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
title_fullStr The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
title_full_unstemmed The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
title_short The impact of sequencing depth on the inferred taxonomic composition and AMR gene content of metagenomic samples
title_sort impact of sequencing depth on the inferred taxonomic composition and amr gene content of metagenomic samples
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8204541/
https://www.ncbi.nlm.nih.gov/pubmed/33902704
http://dx.doi.org/10.1186/s40793-019-0347-1
work_keys_str_mv AT gweonhsoon theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT shawliamp theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT swannjeremy theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT demaionicola theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT abuounmanal theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT niehusrene theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT hubbardalasdairtm theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT bowesmikej theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT baileymarkj theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT petotimea theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT hoosdallysarahj theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT walkerasarah theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT sebrarobertp theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT crookderrickw theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT anjummunaf theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT readdaniels theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT stoessernicole theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT theimpactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT gweonhsoon impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT shawliamp impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT swannjeremy impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT demaionicola impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT abuounmanal impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT niehusrene impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT hubbardalasdairtm impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT bowesmikej impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT baileymarkj impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT petotimea impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT hoosdallysarahj impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT walkerasarah impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT sebrarobertp impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT crookderrickw impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT anjummunaf impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT readdaniels impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT stoessernicole impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples
AT impactofsequencingdepthontheinferredtaxonomiccompositionandamrgenecontentofmetagenomicsamples