Cargando…

Latent environment allocation of microbial community data

As data for microbial community structures found in various environments has increased, studies have examined the relationship between environmental labels given to retrieved microbial samples and their community structures. However, because environments continuously change over time and space, mixe...

Descripción completa

Detalles Bibliográficos
Autores principales: Higashi, Koichi, Suzuki, Shinya, Kurosawa, Shin, Mori, Hiroshi, Kurokawa, Ken
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6005635/
https://www.ncbi.nlm.nih.gov/pubmed/29874232
http://dx.doi.org/10.1371/journal.pcbi.1006143
_version_ 1783332718756495360
author Higashi, Koichi
Suzuki, Shinya
Kurosawa, Shin
Mori, Hiroshi
Kurokawa, Ken
author_facet Higashi, Koichi
Suzuki, Shinya
Kurosawa, Shin
Mori, Hiroshi
Kurokawa, Ken
author_sort Higashi, Koichi
collection PubMed
description As data for microbial community structures found in various environments has increased, studies have examined the relationship between environmental labels given to retrieved microbial samples and their community structures. However, because environments continuously change over time and space, mixed states of some environments and its effects on community formation should be considered, instead of evaluating effects of discrete environmental categories. Here we applied a hierarchical Bayesian model to paired datasets containing more than 30,000 samples of microbial community structures and sample description documents. From the training results, we extracted latent environmental topics that associate co-occurring microbes with co-occurring word sets among samples. Topics are the core elements of environmental mixtures and the visualization of topic-based samples clarifies the connections of various environments. Based on the model training results, we developed a web application, LEA (Latent Environment Allocation), which provides the way to evaluate typicality and heterogeneity of microbial communities in newly obtained samples without confining environmental categories to be compared. Because topics link words and microbes, LEA also enables to search samples semantically related to the query out of 30,000 microbiome samples.
format Online
Article
Text
id pubmed-6005635
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-60056352018-06-25 Latent environment allocation of microbial community data Higashi, Koichi Suzuki, Shinya Kurosawa, Shin Mori, Hiroshi Kurokawa, Ken PLoS Comput Biol Research Article As data for microbial community structures found in various environments has increased, studies have examined the relationship between environmental labels given to retrieved microbial samples and their community structures. However, because environments continuously change over time and space, mixed states of some environments and its effects on community formation should be considered, instead of evaluating effects of discrete environmental categories. Here we applied a hierarchical Bayesian model to paired datasets containing more than 30,000 samples of microbial community structures and sample description documents. From the training results, we extracted latent environmental topics that associate co-occurring microbes with co-occurring word sets among samples. Topics are the core elements of environmental mixtures and the visualization of topic-based samples clarifies the connections of various environments. Based on the model training results, we developed a web application, LEA (Latent Environment Allocation), which provides the way to evaluate typicality and heterogeneity of microbial communities in newly obtained samples without confining environmental categories to be compared. Because topics link words and microbes, LEA also enables to search samples semantically related to the query out of 30,000 microbiome samples. Public Library of Science 2018-06-06 /pmc/articles/PMC6005635/ /pubmed/29874232 http://dx.doi.org/10.1371/journal.pcbi.1006143 Text en © 2018 Higashi et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Higashi, Koichi
Suzuki, Shinya
Kurosawa, Shin
Mori, Hiroshi
Kurokawa, Ken
Latent environment allocation of microbial community data
title Latent environment allocation of microbial community data
title_full Latent environment allocation of microbial community data
title_fullStr Latent environment allocation of microbial community data
title_full_unstemmed Latent environment allocation of microbial community data
title_short Latent environment allocation of microbial community data
title_sort latent environment allocation of microbial community data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6005635/
https://www.ncbi.nlm.nih.gov/pubmed/29874232
http://dx.doi.org/10.1371/journal.pcbi.1006143
work_keys_str_mv AT higashikoichi latentenvironmentallocationofmicrobialcommunitydata
AT suzukishinya latentenvironmentallocationofmicrobialcommunitydata
AT kurosawashin latentenvironmentallocationofmicrobialcommunitydata
AT morihiroshi latentenvironmentallocationofmicrobialcommunitydata
AT kurokawaken latentenvironmentallocationofmicrobialcommunitydata