Cargando…

Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents

One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large...

Descripción completa

Detalles Bibliográficos
Autores principales: Usie, Anabel, Karathia, Hiren, Teixidó, Ivan, Alves, Rui, Solsona, Francesc
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940481/
https://www.ncbi.nlm.nih.gov/pubmed/24688854
http://dx.doi.org/10.7717/peerj.276
_version_ 1782305794807562240
author Usie, Anabel
Karathia, Hiren
Teixidó, Ivan
Alves, Rui
Solsona, Francesc
author_facet Usie, Anabel
Karathia, Hiren
Teixidó, Ivan
Alves, Rui
Solsona, Francesc
author_sort Usie, Anabel
collection PubMed
description One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large literature datasets. Nevertheless, experimental biologists have a limited number of available user-friendly tools that use text-mining for network reconstruction and require no programming skills to use. One of these tools is Biblio-MetReS. Originally, this tool permitted an on-the-fly analysis of documents contained in a number of web-based literature databases to identify co-occurrence of proteins/genes. This approach ensured results that were always up-to-date with the latest live version of the databases. However, this ‘up-to-dateness’ came at the cost of large execution times. Here we report an evolution of the application Biblio-MetReS that permits constructing co-occurrence networks for genes, GO processes, Pathways, or any combination of the three types of entities and graphically represent those entities. We show that the performance of Biblio-MetReS in identifying gene co-occurrence is as least as good as that of other comparable applications (STRING and iHOP). In addition, we also show that the identification of GO processes is on par to that reported in the latest BioCreAtIvE challenge. Finally, we also report the implementation of a new strategy that combines on-the-fly analysis of new documents with preprocessed information from documents that were encountered in previous analyses. This combination simultaneously decreases program run time and maintains ‘up-to-dateness’ of the results. Availability: http://metres.udl.cat/index.php/downloads, Contact: metres.cmb@gmail.com.
format Online
Article
Text
id pubmed-3940481
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-39404812014-03-31 Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents Usie, Anabel Karathia, Hiren Teixidó, Ivan Alves, Rui Solsona, Francesc PeerJ Bioinformatics One way to initiate the reconstruction of molecular circuits is by using automated text-mining techniques. Developing more efficient methods for such reconstruction is a topic of active research, and those methods are typically included by bioinformaticians in pipelines used to mine and curate large literature datasets. Nevertheless, experimental biologists have a limited number of available user-friendly tools that use text-mining for network reconstruction and require no programming skills to use. One of these tools is Biblio-MetReS. Originally, this tool permitted an on-the-fly analysis of documents contained in a number of web-based literature databases to identify co-occurrence of proteins/genes. This approach ensured results that were always up-to-date with the latest live version of the databases. However, this ‘up-to-dateness’ came at the cost of large execution times. Here we report an evolution of the application Biblio-MetReS that permits constructing co-occurrence networks for genes, GO processes, Pathways, or any combination of the three types of entities and graphically represent those entities. We show that the performance of Biblio-MetReS in identifying gene co-occurrence is as least as good as that of other comparable applications (STRING and iHOP). In addition, we also show that the identification of GO processes is on par to that reported in the latest BioCreAtIvE challenge. Finally, we also report the implementation of a new strategy that combines on-the-fly analysis of new documents with preprocessed information from documents that were encountered in previous analyses. This combination simultaneously decreases program run time and maintains ‘up-to-dateness’ of the results. Availability: http://metres.udl.cat/index.php/downloads, Contact: metres.cmb@gmail.com. PeerJ Inc. 2014-02-27 /pmc/articles/PMC3940481/ /pubmed/24688854 http://dx.doi.org/10.7717/peerj.276 Text en © 2014 Usie et al. http://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Bioinformatics
Usie, Anabel
Karathia, Hiren
Teixidó, Ivan
Alves, Rui
Solsona, Francesc
Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents
title Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents
title_full Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents
title_fullStr Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents
title_full_unstemmed Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents
title_short Biblio-MetReS for user-friendly mining of genes and biological processes in scientific documents
title_sort biblio-metres for user-friendly mining of genes and biological processes in scientific documents
topic Bioinformatics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3940481/
https://www.ncbi.nlm.nih.gov/pubmed/24688854
http://dx.doi.org/10.7717/peerj.276
work_keys_str_mv AT usieanabel bibliometresforuserfriendlyminingofgenesandbiologicalprocessesinscientificdocuments
AT karathiahiren bibliometresforuserfriendlyminingofgenesandbiologicalprocessesinscientificdocuments
AT teixidoivan bibliometresforuserfriendlyminingofgenesandbiologicalprocessesinscientificdocuments
AT alvesrui bibliometresforuserfriendlyminingofgenesandbiologicalprocessesinscientificdocuments
AT solsonafrancesc bibliometresforuserfriendlyminingofgenesandbiologicalprocessesinscientificdocuments