Cargando…

Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database

High-throughput sequencing 16S rRNA gene surveys have enabled new insights into the diversity of soil bacteria, and furthered understanding of the ecological drivers of abundances across landscapes. However, current analytical approaches are of limited use in formalizing syntheses of the ecological...

Descripción completa

Detalles Bibliográficos
Autores principales: Jones, Briony, Goodall, Tim, George, Paul B. L., Gweon, Hyun S., Puissant, Jeremy, Read, Daniel S., Emmett, Bridget A., Robinson, David A., Jones, Davey L., Griffiths, Robert I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8326369/
https://www.ncbi.nlm.nih.gov/pubmed/34349739
http://dx.doi.org/10.3389/fmicb.2021.682886
_version_ 1783731798700720128
author Jones, Briony
Goodall, Tim
George, Paul B. L.
Gweon, Hyun S.
Puissant, Jeremy
Read, Daniel S.
Emmett, Bridget A.
Robinson, David A.
Jones, Davey L.
Griffiths, Robert I.
author_facet Jones, Briony
Goodall, Tim
George, Paul B. L.
Gweon, Hyun S.
Puissant, Jeremy
Read, Daniel S.
Emmett, Bridget A.
Robinson, David A.
Jones, Davey L.
Griffiths, Robert I.
author_sort Jones, Briony
collection PubMed
description High-throughput sequencing 16S rRNA gene surveys have enabled new insights into the diversity of soil bacteria, and furthered understanding of the ecological drivers of abundances across landscapes. However, current analytical approaches are of limited use in formalizing syntheses of the ecological attributes of taxa discovered, because derived taxonomic units are typically unique to individual studies and sequence identification databases only characterize taxonomy. To address this, we used sequences obtained from a large nationwide soil survey (GB Countryside Survey, henceforth CS) to create a comprehensive soil specific 16S reference database, with coupled ecological information derived from survey metadata. Specifically, we modeled taxon responses to soil pH at the OTU level using hierarchical logistic regression (HOF) models, to provide information on both the shape of landscape scale pH-abundance responses, and pH optima (pH at which OTU abundance is maximal). We identify that most of the soil OTUs examined exhibited a non-flat relationship with soil pH. Further, the pH optima could not be generalized by broad taxonomy, highlighting the need for tools and databases synthesizing ecological traits at finer taxonomic resolution. We further demonstrate the utility of the database by testing against geographically dispersed query 16S datasets; evaluating efficacy by quantifying matches, and accuracy in predicting pH responses of query sequences from a separate large soil survey. We found that the CS database provided good coverage of dominant taxa; and that the taxa indicating soil pH in a query dataset corresponded with the pH classifications of top matches in the CS database. Furthermore we were able to predict query dataset community structure, using predicted abundances of dominant taxa based on query soil pH data and the HOF models of matched CS database taxa. The database with associated HOF model outputs is released as an online portal for querying single sequences of interest (https://shiny-apps.ceh.ac.uk/ID-TaxER/), and flat files are made available for use in bioinformatic pipelines. The further development of advanced informatics infrastructures incorporating modeled ecological attributes along with new functional genomic information will likely facilitate large scale exploration and prediction of soil microbial functional biodiversity under current and future environmental change scenarios.
format Online
Article
Text
id pubmed-8326369
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-83263692021-08-03 Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database Jones, Briony Goodall, Tim George, Paul B. L. Gweon, Hyun S. Puissant, Jeremy Read, Daniel S. Emmett, Bridget A. Robinson, David A. Jones, Davey L. Griffiths, Robert I. Front Microbiol Microbiology High-throughput sequencing 16S rRNA gene surveys have enabled new insights into the diversity of soil bacteria, and furthered understanding of the ecological drivers of abundances across landscapes. However, current analytical approaches are of limited use in formalizing syntheses of the ecological attributes of taxa discovered, because derived taxonomic units are typically unique to individual studies and sequence identification databases only characterize taxonomy. To address this, we used sequences obtained from a large nationwide soil survey (GB Countryside Survey, henceforth CS) to create a comprehensive soil specific 16S reference database, with coupled ecological information derived from survey metadata. Specifically, we modeled taxon responses to soil pH at the OTU level using hierarchical logistic regression (HOF) models, to provide information on both the shape of landscape scale pH-abundance responses, and pH optima (pH at which OTU abundance is maximal). We identify that most of the soil OTUs examined exhibited a non-flat relationship with soil pH. Further, the pH optima could not be generalized by broad taxonomy, highlighting the need for tools and databases synthesizing ecological traits at finer taxonomic resolution. We further demonstrate the utility of the database by testing against geographically dispersed query 16S datasets; evaluating efficacy by quantifying matches, and accuracy in predicting pH responses of query sequences from a separate large soil survey. We found that the CS database provided good coverage of dominant taxa; and that the taxa indicating soil pH in a query dataset corresponded with the pH classifications of top matches in the CS database. Furthermore we were able to predict query dataset community structure, using predicted abundances of dominant taxa based on query soil pH data and the HOF models of matched CS database taxa. The database with associated HOF model outputs is released as an online portal for querying single sequences of interest (https://shiny-apps.ceh.ac.uk/ID-TaxER/), and flat files are made available for use in bioinformatic pipelines. The further development of advanced informatics infrastructures incorporating modeled ecological attributes along with new functional genomic information will likely facilitate large scale exploration and prediction of soil microbial functional biodiversity under current and future environmental change scenarios. Frontiers Media S.A. 2021-07-19 /pmc/articles/PMC8326369/ /pubmed/34349739 http://dx.doi.org/10.3389/fmicb.2021.682886 Text en Copyright © 2021 Jones, Goodall, George, Gweon, Puissant, Read, Emmett, Robinson, Jones and Griffiths. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Microbiology
Jones, Briony
Goodall, Tim
George, Paul B. L.
Gweon, Hyun S.
Puissant, Jeremy
Read, Daniel S.
Emmett, Bridget A.
Robinson, David A.
Jones, Davey L.
Griffiths, Robert I.
Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database
title Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database
title_full Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database
title_fullStr Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database
title_full_unstemmed Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database
title_short Beyond Taxonomic Identification: Integration of Ecological Responses to a Soil Bacterial 16S rRNA Gene Database
title_sort beyond taxonomic identification: integration of ecological responses to a soil bacterial 16s rrna gene database
topic Microbiology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8326369/
https://www.ncbi.nlm.nih.gov/pubmed/34349739
http://dx.doi.org/10.3389/fmicb.2021.682886
work_keys_str_mv AT jonesbriony beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT goodalltim beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT georgepaulbl beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT gweonhyuns beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT puissantjeremy beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT readdaniels beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT emmettbridgeta beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT robinsondavida beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT jonesdaveyl beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase
AT griffithsroberti beyondtaxonomicidentificationintegrationofecologicalresponsestoasoilbacterial16srrnagenedatabase