Cargando…

PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types

To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration tec...

Descripción completa

Detalles Bibliográficos
Autores principales: Zafeiropoulos, Haris, Paragkamian, Savvas, Ninidakis, Stelios, Pavlopoulos, Georgios A., Jensen, Lars Juhl, Pafilis, Evangelos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8879827/
https://www.ncbi.nlm.nih.gov/pubmed/35208748
http://dx.doi.org/10.3390/microorganisms10020293
_version_ 1784659004156280832
author Zafeiropoulos, Haris
Paragkamian, Savvas
Ninidakis, Stelios
Pavlopoulos, Georgios A.
Jensen, Lars Juhl
Pafilis, Evangelos
author_facet Zafeiropoulos, Haris
Paragkamian, Savvas
Ninidakis, Stelios
Pavlopoulos, Georgios A.
Jensen, Lars Juhl
Pafilis, Evangelos
author_sort Zafeiropoulos, Haris
collection PubMed
description To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO’s capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes.
format Online
Article
Text
id pubmed-8879827
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-88798272022-02-26 PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types Zafeiropoulos, Haris Paragkamian, Savvas Ninidakis, Stelios Pavlopoulos, Georgios A. Jensen, Lars Juhl Pafilis, Evangelos Microorganisms Article To elucidate ecosystem functioning, it is fundamental to recognize what processes occur in which environments (where) and which microorganisms carry them out (who). Here, we present PREGO, a one-stop-shop knowledge base providing such associations. PREGO combines text mining and data integration techniques to mine such what-where-who associations from data and metadata scattered in the scientific literature and in public omics repositories. Microorganisms, biological processes, and environment types are identified and mapped to ontology terms from established community resources. Analyses of comentions in text and co-occurrences in metagenomics data/metadata are performed to extract associations and a level of confidence is assigned to each of them thanks to a scoring scheme. The PREGO knowledge base contains associations for 364,508 microbial taxa, 1090 environmental types, 15,091 biological processes, and 7971 molecular functions with a total of almost 58 million associations. These associations are available through a web portal, an Application Programming Interface (API), and bulk download. By exploring environments and/or processes associated with each other or with microbes, PREGO aims to assist researchers in design and interpretation of experiments and their results. To demonstrate PREGO’s capabilities, a thorough presentation of its web interface is given along with a meta-analysis of experimental results from a lagoon-sediment study of sulfur-cycle related microbes. MDPI 2022-01-26 /pmc/articles/PMC8879827/ /pubmed/35208748 http://dx.doi.org/10.3390/microorganisms10020293 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Zafeiropoulos, Haris
Paragkamian, Savvas
Ninidakis, Stelios
Pavlopoulos, Georgios A.
Jensen, Lars Juhl
Pafilis, Evangelos
PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types
title PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types
title_full PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types
title_fullStr PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types
title_full_unstemmed PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types
title_short PREGO: A Literature and Data-Mining Resource to Associate Microorganisms, Biological Processes, and Environment Types
title_sort prego: a literature and data-mining resource to associate microorganisms, biological processes, and environment types
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8879827/
https://www.ncbi.nlm.nih.gov/pubmed/35208748
http://dx.doi.org/10.3390/microorganisms10020293
work_keys_str_mv AT zafeiropoulosharis pregoaliteratureanddataminingresourcetoassociatemicroorganismsbiologicalprocessesandenvironmenttypes
AT paragkamiansavvas pregoaliteratureanddataminingresourcetoassociatemicroorganismsbiologicalprocessesandenvironmenttypes
AT ninidakisstelios pregoaliteratureanddataminingresourcetoassociatemicroorganismsbiologicalprocessesandenvironmenttypes
AT pavlopoulosgeorgiosa pregoaliteratureanddataminingresourcetoassociatemicroorganismsbiologicalprocessesandenvironmenttypes
AT jensenlarsjuhl pregoaliteratureanddataminingresourcetoassociatemicroorganismsbiologicalprocessesandenvironmenttypes
AT pafilisevangelos pregoaliteratureanddataminingresourcetoassociatemicroorganismsbiologicalprocessesandenvironmenttypes