Cargando…

Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis

Our understanding of complex microbial communities, such as those residing in the rumen, has drastically advanced through the use of high throughput sequencing (HTS) technologies. Indeed, with the use of barcoded amplicon sequencing, it is now cost effective and computationally feasible to identify...

Descripción completa

Detalles Bibliográficos
Autores principales: Smith, Paul E., Waters, Sinead M., Gómez Expósito, Ruth, Smidt, Hauke, Carberry, Ciara A., McCabe, Matthew S.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7752867/
https://www.ncbi.nlm.nih.gov/pubmed/33363527
http://dx.doi.org/10.3389/fmicb.2020.606825
_version_ 1783625950654627840
author Smith, Paul E.
Waters, Sinead M.
Gómez Expósito, Ruth
Smidt, Hauke
Carberry, Ciara A.
McCabe, Matthew S.
author_facet Smith, Paul E.
Waters, Sinead M.
Gómez Expósito, Ruth
Smidt, Hauke
Carberry, Ciara A.
McCabe, Matthew S.
author_sort Smith, Paul E.
collection PubMed
description Our understanding of complex microbial communities, such as those residing in the rumen, has drastically advanced through the use of high throughput sequencing (HTS) technologies. Indeed, with the use of barcoded amplicon sequencing, it is now cost effective and computationally feasible to identify individual rumen microbial genera associated with ruminant livestock nutrition, genetics, performance and greenhouse gas production. However, across all disciplines of microbial ecology, there is currently little reporting of the use of internal controls for validating HTS results. Furthermore, there is little consensus of the most appropriate reference database for analyzing rumen microbiota amplicon sequencing data. Therefore, in this study, a synthetic rumen-specific sequencing standard was used to assess the effects of database choice on results obtained from rumen microbial amplicon sequencing. Four DADA2 reference training sets (RDP, SILVA, GTDB, and RefSeq + RDP) were compared to assess their ability to correctly classify sequences included in the rumen-specific sequencing standard. In addition, two thresholds of phylogenetic bootstrapping, 50 and 80, were applied to investigate the effect of increasing stringency. Sequence classification differences were apparent amongst the databases. For example the classification of Clostridium differed between all databases, thus highlighting the need for a consistent approach to nomenclature amongst different reference databases. It is hoped the effect of database on taxonomic classification observed in this study, will encourage research groups across various microbial disciplines to develop and routinely use their own microbiome-specific reference standard to validate analysis pipelines and database choice.
format Online
Article
Text
id pubmed-7752867
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-77528672020-12-23 Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis Smith, Paul E. Waters, Sinead M. Gómez Expósito, Ruth Smidt, Hauke Carberry, Ciara A. McCabe, Matthew S. Front Microbiol Microbiology Our understanding of complex microbial communities, such as those residing in the rumen, has drastically advanced through the use of high throughput sequencing (HTS) technologies. Indeed, with the use of barcoded amplicon sequencing, it is now cost effective and computationally feasible to identify individual rumen microbial genera associated with ruminant livestock nutrition, genetics, performance and greenhouse gas production. However, across all disciplines of microbial ecology, there is currently little reporting of the use of internal controls for validating HTS results. Furthermore, there is little consensus of the most appropriate reference database for analyzing rumen microbiota amplicon sequencing data. Therefore, in this study, a synthetic rumen-specific sequencing standard was used to assess the effects of database choice on results obtained from rumen microbial amplicon sequencing. Four DADA2 reference training sets (RDP, SILVA, GTDB, and RefSeq + RDP) were compared to assess their ability to correctly classify sequences included in the rumen-specific sequencing standard. In addition, two thresholds of phylogenetic bootstrapping, 50 and 80, were applied to investigate the effect of increasing stringency. Sequence classification differences were apparent amongst the databases. For example the classification of Clostridium differed between all databases, thus highlighting the need for a consistent approach to nomenclature amongst different reference databases. It is hoped the effect of database on taxonomic classification observed in this study, will encourage research groups across various microbial disciplines to develop and routinely use their own microbiome-specific reference standard to validate analysis pipelines and database choice. Frontiers Media S.A. 2020-12-08 /pmc/articles/PMC7752867/ /pubmed/33363527 http://dx.doi.org/10.3389/fmicb.2020.606825 Text en Copyright © 2020 Smith, Waters, Gómez Expósito, Smidt, Carberry and McCabe. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Smith, Paul E.
Waters, Sinead M.
Gómez Expósito, Ruth
Smidt, Hauke
Carberry, Ciara A.
McCabe, Matthew S.
Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis
title Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis
title_full Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis
title_fullStr Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis
title_full_unstemmed Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis
title_short Synthetic Sequencing Standards: A Guide to Database Choice for Rumen Microbiota Amplicon Sequencing Analysis
title_sort synthetic sequencing standards: a guide to database choice for rumen microbiota amplicon sequencing analysis
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7752867/
https://www.ncbi.nlm.nih.gov/pubmed/33363527
http://dx.doi.org/10.3389/fmicb.2020.606825
work_keys_str_mv AT smithpaule syntheticsequencingstandardsaguidetodatabasechoiceforrumenmicrobiotaampliconsequencinganalysis
AT waterssineadm syntheticsequencingstandardsaguidetodatabasechoiceforrumenmicrobiotaampliconsequencinganalysis
AT gomezexpositoruth syntheticsequencingstandardsaguidetodatabasechoiceforrumenmicrobiotaampliconsequencinganalysis
AT smidthauke syntheticsequencingstandardsaguidetodatabasechoiceforrumenmicrobiotaampliconsequencinganalysis
AT carberryciaraa syntheticsequencingstandardsaguidetodatabasechoiceforrumenmicrobiotaampliconsequencinganalysis
AT mccabematthews syntheticsequencingstandardsaguidetodatabasechoiceforrumenmicrobiotaampliconsequencinganalysis