Cargando…

Using neural networks to mine text and predict metabolic traits for thousands of microbes

Microbes can metabolize more chemical compounds than any other group of organisms. As a result, their metabolism is of interest to investigators across biology. Despite the interest, information on metabolism of specific microbes is hard to access. Information is buried in text of books and journals...

Descripción completa

Detalles Bibliográficos
Autores principales: Hackmann, Timothy J., Zhang, Bo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954334/
https://www.ncbi.nlm.nih.gov/pubmed/33651810
http://dx.doi.org/10.1371/journal.pcbi.1008757
_version_ 1783664058959921152
author Hackmann, Timothy J.
Zhang, Bo
author_facet Hackmann, Timothy J.
Zhang, Bo
author_sort Hackmann, Timothy J.
collection PubMed
description Microbes can metabolize more chemical compounds than any other group of organisms. As a result, their metabolism is of interest to investigators across biology. Despite the interest, information on metabolism of specific microbes is hard to access. Information is buried in text of books and journals, and investigators have no easy way to extract it out. Here we investigate if neural networks can extract out this information and predict metabolic traits. For proof of concept, we predicted two traits: whether microbes carry one type of metabolism (fermentation) or produce one metabolite (acetate). We collected written descriptions of 7,021 species of bacteria and archaea from Bergey’s Manual. We read the descriptions and manually identified (labeled) which species were fermentative or produced acetate. We then trained neural networks to predict these labels. In total, we identified 2,364 species as fermentative, and 1,009 species as also producing acetate. Neural networks could predict which species were fermentative with 97.3% accuracy. Accuracy was even higher (98.6%) when predicting species also producing acetate. Phylogenetic trees of species and their traits confirmed that predictions were accurate. Our approach with neural networks can extract information efficiently and accurately. It paves the way for putting more metabolic traits into databases, providing easy access of information to investigators.
format Online
Article
Text
id pubmed-7954334
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-79543342021-03-22 Using neural networks to mine text and predict metabolic traits for thousands of microbes Hackmann, Timothy J. Zhang, Bo PLoS Comput Biol Research Article Microbes can metabolize more chemical compounds than any other group of organisms. As a result, their metabolism is of interest to investigators across biology. Despite the interest, information on metabolism of specific microbes is hard to access. Information is buried in text of books and journals, and investigators have no easy way to extract it out. Here we investigate if neural networks can extract out this information and predict metabolic traits. For proof of concept, we predicted two traits: whether microbes carry one type of metabolism (fermentation) or produce one metabolite (acetate). We collected written descriptions of 7,021 species of bacteria and archaea from Bergey’s Manual. We read the descriptions and manually identified (labeled) which species were fermentative or produced acetate. We then trained neural networks to predict these labels. In total, we identified 2,364 species as fermentative, and 1,009 species as also producing acetate. Neural networks could predict which species were fermentative with 97.3% accuracy. Accuracy was even higher (98.6%) when predicting species also producing acetate. Phylogenetic trees of species and their traits confirmed that predictions were accurate. Our approach with neural networks can extract information efficiently and accurately. It paves the way for putting more metabolic traits into databases, providing easy access of information to investigators. Public Library of Science 2021-03-02 /pmc/articles/PMC7954334/ /pubmed/33651810 http://dx.doi.org/10.1371/journal.pcbi.1008757 Text en © 2021 Hackmann, Zhang http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hackmann, Timothy J.
Zhang, Bo
Using neural networks to mine text and predict metabolic traits for thousands of microbes
title Using neural networks to mine text and predict metabolic traits for thousands of microbes
title_full Using neural networks to mine text and predict metabolic traits for thousands of microbes
title_fullStr Using neural networks to mine text and predict metabolic traits for thousands of microbes
title_full_unstemmed Using neural networks to mine text and predict metabolic traits for thousands of microbes
title_short Using neural networks to mine text and predict metabolic traits for thousands of microbes
title_sort using neural networks to mine text and predict metabolic traits for thousands of microbes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7954334/
https://www.ncbi.nlm.nih.gov/pubmed/33651810
http://dx.doi.org/10.1371/journal.pcbi.1008757
work_keys_str_mv AT hackmanntimothyj usingneuralnetworkstominetextandpredictmetabolictraitsforthousandsofmicrobes
AT zhangbo usingneuralnetworkstominetextandpredictmetabolictraitsforthousandsofmicrobes