Cargando…

BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis

The inference of novel knowledge and new hypotheses from the current literature analysis is crucial in making new scientific discoveries. In bio-medicine, given the enormous amount of literature and knowledge bases available, the automatic gain of knowledge concerning relationships among biological...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Maria, Antonio, Alaimo, Salvatore, Bellomo, Lorenzo, Billeci, Fabrizio, Ferragina, Paolo, Ferro, Alfredo, Pulvirenti, Alfredo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9096447/
https://www.ncbi.nlm.nih.gov/pubmed/35571058
http://dx.doi.org/10.3389/fgene.2022.855739
_version_ 1784705981429579776
author Di Maria, Antonio
Alaimo, Salvatore
Bellomo, Lorenzo
Billeci, Fabrizio
Ferragina, Paolo
Ferro, Alfredo
Pulvirenti, Alfredo
author_facet Di Maria, Antonio
Alaimo, Salvatore
Bellomo, Lorenzo
Billeci, Fabrizio
Ferragina, Paolo
Ferro, Alfredo
Pulvirenti, Alfredo
author_sort Di Maria, Antonio
collection PubMed
description The inference of novel knowledge and new hypotheses from the current literature analysis is crucial in making new scientific discoveries. In bio-medicine, given the enormous amount of literature and knowledge bases available, the automatic gain of knowledge concerning relationships among biological elements, in the form of semantically related terms (or entities), is rising novel research challenges and corresponding applications. In this regard, we propose BioTAGME, a system that combines an entity-annotation framework based on Wikipedia corpus (i.e., TAGME tool) with a network-based inference methodology (i.e., DT-Hybrid). This integration aims to create an extensive Knowledge Graph modeling relations among biological terms and phrases extracted from titles and abstracts of papers available in PubMed. The framework consists of a back-end and a front-end. The back-end is entirely implemented in Scala and runs on top of a Spark cluster that distributes the computing effort among several machines. The front-end is released through the Laravel framework, connected with the Neo4j graph database to store the knowledge graph.
format Online
Article
Text
id pubmed-9096447
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-90964472022-05-13 BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis Di Maria, Antonio Alaimo, Salvatore Bellomo, Lorenzo Billeci, Fabrizio Ferragina, Paolo Ferro, Alfredo Pulvirenti, Alfredo Front Genet Genetics The inference of novel knowledge and new hypotheses from the current literature analysis is crucial in making new scientific discoveries. In bio-medicine, given the enormous amount of literature and knowledge bases available, the automatic gain of knowledge concerning relationships among biological elements, in the form of semantically related terms (or entities), is rising novel research challenges and corresponding applications. In this regard, we propose BioTAGME, a system that combines an entity-annotation framework based on Wikipedia corpus (i.e., TAGME tool) with a network-based inference methodology (i.e., DT-Hybrid). This integration aims to create an extensive Knowledge Graph modeling relations among biological terms and phrases extracted from titles and abstracts of papers available in PubMed. The framework consists of a back-end and a front-end. The back-end is entirely implemented in Scala and runs on top of a Spark cluster that distributes the computing effort among several machines. The front-end is released through the Laravel framework, connected with the Neo4j graph database to store the knowledge graph. Frontiers Media S.A. 2022-04-28 /pmc/articles/PMC9096447/ /pubmed/35571058 http://dx.doi.org/10.3389/fgene.2022.855739 Text en Copyright © 2022 Di Maria, Alaimo, Bellomo, Billeci, Ferragina, Ferro and Pulvirenti. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Di Maria, Antonio
Alaimo, Salvatore
Bellomo, Lorenzo
Billeci, Fabrizio
Ferragina, Paolo
Ferro, Alfredo
Pulvirenti, Alfredo
BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis
title BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis
title_full BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis
title_fullStr BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis
title_full_unstemmed BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis
title_short BioTAGME: A Comprehensive Platform for Biological Knowledge Network Analysis
title_sort biotagme: a comprehensive platform for biological knowledge network analysis
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9096447/
https://www.ncbi.nlm.nih.gov/pubmed/35571058
http://dx.doi.org/10.3389/fgene.2022.855739
work_keys_str_mv AT dimariaantonio biotagmeacomprehensiveplatformforbiologicalknowledgenetworkanalysis
AT alaimosalvatore biotagmeacomprehensiveplatformforbiologicalknowledgenetworkanalysis
AT bellomolorenzo biotagmeacomprehensiveplatformforbiologicalknowledgenetworkanalysis
AT billecifabrizio biotagmeacomprehensiveplatformforbiologicalknowledgenetworkanalysis
AT ferraginapaolo biotagmeacomprehensiveplatformforbiologicalknowledgenetworkanalysis
AT ferroalfredo biotagmeacomprehensiveplatformforbiologicalknowledgenetworkanalysis
AT pulvirentialfredo biotagmeacomprehensiveplatformforbiologicalknowledgenetworkanalysis