Cargando…

specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data

BACKGROUND: Understanding the factors that influence microbes’ environmental distributions is important for determining drivers of microbial community composition. These include environmental variables like temperature and pH, and higher-dimensional variables like geographic distance and host specie...

Descripción completa

Detalles Bibliográficos
Autores principales: Darcy, John L., Amend, Anthony S., Swift, Sean O. I., Sommers, Pacifica S., Lozupone, Catherine A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9233361/
https://www.ncbi.nlm.nih.gov/pubmed/35752802
http://dx.doi.org/10.1186/s40793-022-00426-0
_version_ 1784735747083862016
author Darcy, John L.
Amend, Anthony S.
Swift, Sean O. I.
Sommers, Pacifica S.
Lozupone, Catherine A.
author_facet Darcy, John L.
Amend, Anthony S.
Swift, Sean O. I.
Sommers, Pacifica S.
Lozupone, Catherine A.
author_sort Darcy, John L.
collection PubMed
description BACKGROUND: Understanding the factors that influence microbes’ environmental distributions is important for determining drivers of microbial community composition. These include environmental variables like temperature and pH, and higher-dimensional variables like geographic distance and host species phylogeny. In microbial ecology, “specificity” is often described in the context of symbiotic or host parasitic interactions, but specificity can be more broadly used to describe the extent to which a species occupies a narrower range of an environmental variable than expected by chance. Using a standardization we describe here, Rao’s (Theor Popul Biol, 1982. https://doi.org/10.1016/0040-5809(82)90004-1, Sankhya A, 2010. https://doi.org/10.1007/s13171-010-0016-3 ) Quadratic Entropy can be conveniently applied to calculate specificity of a feature, such as a species, to many different environmental variables. RESULTS: We present our R package specificity for performing the above analyses, and apply it to four real-life microbial data sets to demonstrate its application. We found that many fungi within the leaves of native Hawaiian plants had strong specificity to rainfall and elevation, even though these variables showed minimal importance in a previous analysis of fungal beta-diversity. In Antarctic cryoconite holes, our tool revealed that many bacteria have specificity to co-occurring algal community composition. Similarly, in the human gut microbiome, many bacteria showed specificity to the composition of bile acids. Finally, our analysis of the Earth Microbiome Project data set showed that most bacteria show strong ontological specificity to sample type. Our software performed as expected on synthetic data as well. CONCLUSIONS: specificity is well-suited to analysis of microbiome data, both in synthetic test cases, and across multiple environment types and experimental designs. The analysis and software we present here can reveal patterns in microbial taxa that may not be evident from a community-level perspective. These insights can also be visualized and interactively shared among researchers using specificity’s companion package, specificity.shiny. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40793-022-00426-0.
format Online
Article
Text
id pubmed-9233361
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92333612022-06-26 specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data Darcy, John L. Amend, Anthony S. Swift, Sean O. I. Sommers, Pacifica S. Lozupone, Catherine A. Environ Microbiome Software BACKGROUND: Understanding the factors that influence microbes’ environmental distributions is important for determining drivers of microbial community composition. These include environmental variables like temperature and pH, and higher-dimensional variables like geographic distance and host species phylogeny. In microbial ecology, “specificity” is often described in the context of symbiotic or host parasitic interactions, but specificity can be more broadly used to describe the extent to which a species occupies a narrower range of an environmental variable than expected by chance. Using a standardization we describe here, Rao’s (Theor Popul Biol, 1982. https://doi.org/10.1016/0040-5809(82)90004-1, Sankhya A, 2010. https://doi.org/10.1007/s13171-010-0016-3 ) Quadratic Entropy can be conveniently applied to calculate specificity of a feature, such as a species, to many different environmental variables. RESULTS: We present our R package specificity for performing the above analyses, and apply it to four real-life microbial data sets to demonstrate its application. We found that many fungi within the leaves of native Hawaiian plants had strong specificity to rainfall and elevation, even though these variables showed minimal importance in a previous analysis of fungal beta-diversity. In Antarctic cryoconite holes, our tool revealed that many bacteria have specificity to co-occurring algal community composition. Similarly, in the human gut microbiome, many bacteria showed specificity to the composition of bile acids. Finally, our analysis of the Earth Microbiome Project data set showed that most bacteria show strong ontological specificity to sample type. Our software performed as expected on synthetic data as well. CONCLUSIONS: specificity is well-suited to analysis of microbiome data, both in synthetic test cases, and across multiple environment types and experimental designs. The analysis and software we present here can reveal patterns in microbial taxa that may not be evident from a community-level perspective. These insights can also be visualized and interactively shared among researchers using specificity’s companion package, specificity.shiny. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s40793-022-00426-0. BioMed Central 2022-06-25 /pmc/articles/PMC9233361/ /pubmed/35752802 http://dx.doi.org/10.1186/s40793-022-00426-0 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Software
Darcy, John L.
Amend, Anthony S.
Swift, Sean O. I.
Sommers, Pacifica S.
Lozupone, Catherine A.
specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data
title specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data
title_full specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data
title_fullStr specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data
title_full_unstemmed specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data
title_short specificity: an R package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data
title_sort specificity: an r package for analysis of feature specificity to environmental and higher dimensional variables, applied to microbiome species data
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9233361/
https://www.ncbi.nlm.nih.gov/pubmed/35752802
http://dx.doi.org/10.1186/s40793-022-00426-0
work_keys_str_mv AT darcyjohnl specificityanrpackageforanalysisoffeaturespecificitytoenvironmentalandhigherdimensionalvariablesappliedtomicrobiomespeciesdata
AT amendanthonys specificityanrpackageforanalysisoffeaturespecificitytoenvironmentalandhigherdimensionalvariablesappliedtomicrobiomespeciesdata
AT swiftseanoi specificityanrpackageforanalysisoffeaturespecificitytoenvironmentalandhigherdimensionalvariablesappliedtomicrobiomespeciesdata
AT sommerspacificas specificityanrpackageforanalysisoffeaturespecificitytoenvironmentalandhigherdimensionalvariablesappliedtomicrobiomespeciesdata
AT lozuponecatherinea specificityanrpackageforanalysisoffeaturespecificitytoenvironmentalandhigherdimensionalvariablesappliedtomicrobiomespeciesdata