Cargando…

Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning

Microbiome research has generated an extensive amount of data, resulting in a wealth of publicly accessible samples. Accurate annotation of these samples is crucial for effectively utilizing microbiome data across scientific disciplines. However, a notable challenge arises from the lack of essential...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Nan, Wang, Teng, Ning, Kang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457426/
https://www.ncbi.nlm.nih.gov/pubmed/37635952
http://dx.doi.org/10.1016/j.ese.2023.100304
_version_ 1785096922108788736
author Wang, Nan
Wang, Teng
Ning, Kang
author_facet Wang, Nan
Wang, Teng
Ning, Kang
author_sort Wang, Nan
collection PubMed
description Microbiome research has generated an extensive amount of data, resulting in a wealth of publicly accessible samples. Accurate annotation of these samples is crucial for effectively utilizing microbiome data across scientific disciplines. However, a notable challenge arises from the lack of essential annotations, particularly regarding collection location and sample biome information, which significantly hinders environmental microbiome research. In this study, we introduce Meta-Sorter, a novel approach utilizing neural networks and transfer learning, to enhance biome labeling for thousands of microbiome samples in the MGnify database that have incomplete information. Our findings demonstrate that Meta-Sorter achieved a remarkable accuracy rate of 96.7% in classifying samples among the 16,507 lacking detailed biome annotations. Notably, Meta-Sorter provides precise classifications for representative environmental samples that were previously ambiguously labeled as “Marine” in MGnify, thereby elucidating their specific origins in benthic and water column environments. Moreover, Meta-Sorter effectively distinguishes samples derived from human-environment interactions, enabling clear differentiation between environmental and human-related studies. By improving the completeness of biome label information for numerous microbial community samples, our research facilitates more accurate knowledge discovery across diverse disciplines, with particular implications for environmental research.
format Online
Article
Text
id pubmed-10457426
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-104574262023-08-27 Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning Wang, Nan Wang, Teng Ning, Kang Environ Sci Ecotechnol Original Research Microbiome research has generated an extensive amount of data, resulting in a wealth of publicly accessible samples. Accurate annotation of these samples is crucial for effectively utilizing microbiome data across scientific disciplines. However, a notable challenge arises from the lack of essential annotations, particularly regarding collection location and sample biome information, which significantly hinders environmental microbiome research. In this study, we introduce Meta-Sorter, a novel approach utilizing neural networks and transfer learning, to enhance biome labeling for thousands of microbiome samples in the MGnify database that have incomplete information. Our findings demonstrate that Meta-Sorter achieved a remarkable accuracy rate of 96.7% in classifying samples among the 16,507 lacking detailed biome annotations. Notably, Meta-Sorter provides precise classifications for representative environmental samples that were previously ambiguously labeled as “Marine” in MGnify, thereby elucidating their specific origins in benthic and water column environments. Moreover, Meta-Sorter effectively distinguishes samples derived from human-environment interactions, enabling clear differentiation between environmental and human-related studies. By improving the completeness of biome label information for numerous microbial community samples, our research facilitates more accurate knowledge discovery across diverse disciplines, with particular implications for environmental research. Elsevier 2023-07-26 /pmc/articles/PMC10457426/ /pubmed/37635952 http://dx.doi.org/10.1016/j.ese.2023.100304 Text en © 2023 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research
Wang, Nan
Wang, Teng
Ning, Kang
Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning
title Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning
title_full Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning
title_fullStr Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning
title_full_unstemmed Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning
title_short Refining biome labeling for large-scale microbial community samples: Leveraging neural networks and transfer learning
title_sort refining biome labeling for large-scale microbial community samples: leveraging neural networks and transfer learning
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10457426/
https://www.ncbi.nlm.nih.gov/pubmed/37635952
http://dx.doi.org/10.1016/j.ese.2023.100304
work_keys_str_mv AT wangnan refiningbiomelabelingforlargescalemicrobialcommunitysamplesleveragingneuralnetworksandtransferlearning
AT wangteng refiningbiomelabelingforlargescalemicrobialcommunitysamplesleveragingneuralnetworksandtransferlearning
AT ningkang refiningbiomelabelingforlargescalemicrobialcommunitysamplesleveragingneuralnetworksandtransferlearning