Cargando…
LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships
BACKGROUND: Biological knowledge is represented in scientific literature that often describes the function of genes/proteins (bioentities) in terms of their interactions (biointeractions). Such bioentities are often related to biological concepts of interest that are specific of a determined researc...
Autores principales: | , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098111/ https://www.ncbi.nlm.nih.gov/pubmed/20122157 http://dx.doi.org/10.1186/1471-2105-11-70 |
_version_ | 1782203920138895360 |
---|---|
author | Barbosa-Silva, Adriano Soldatos, Theodoros G Magalhães, Ivan LF Pavlopoulos, Georgios A Fontaine, Jean-Fred Andrade-Navarro, Miguel A Schneider, Reinhard Ortega, J Miguel |
author_facet | Barbosa-Silva, Adriano Soldatos, Theodoros G Magalhães, Ivan LF Pavlopoulos, Georgios A Fontaine, Jean-Fred Andrade-Navarro, Miguel A Schneider, Reinhard Ortega, J Miguel |
author_sort | Barbosa-Silva, Adriano |
collection | PubMed |
description | BACKGROUND: Biological knowledge is represented in scientific literature that often describes the function of genes/proteins (bioentities) in terms of their interactions (biointeractions). Such bioentities are often related to biological concepts of interest that are specific of a determined research field. Therefore, the study of the current literature about a selected topic deposited in public databases, facilitates the generation of novel hypotheses associating a set of bioentities to a common context. RESULTS: We created a text mining system (LAITOR: Literature Assistant for Identification of Terms co-Occurrences and Relationships) that analyses co-occurrences of bioentities, biointeractions, and other biological terms in MEDLINE abstracts. The method accounts for the position of the co-occurring terms within sentences or abstracts. The system detected abstracts mentioning protein-protein interactions in a standard test (BioCreative II IAS test data) with a precision of 0.82-0.89 and a recall of 0.48-0.70. We illustrate the application of LAITOR to the detection of plant response genes in a dataset of 1000 abstracts relevant to the topic. CONCLUSIONS: Text mining tools combining the extraction of interacting bioentities and biological concepts with network displays can be helpful in developing reasonable hypotheses in different scientific backgrounds. |
format | Text |
id | pubmed-3098111 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-30981112011-05-20 LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships Barbosa-Silva, Adriano Soldatos, Theodoros G Magalhães, Ivan LF Pavlopoulos, Georgios A Fontaine, Jean-Fred Andrade-Navarro, Miguel A Schneider, Reinhard Ortega, J Miguel BMC Bioinformatics Software BACKGROUND: Biological knowledge is represented in scientific literature that often describes the function of genes/proteins (bioentities) in terms of their interactions (biointeractions). Such bioentities are often related to biological concepts of interest that are specific of a determined research field. Therefore, the study of the current literature about a selected topic deposited in public databases, facilitates the generation of novel hypotheses associating a set of bioentities to a common context. RESULTS: We created a text mining system (LAITOR: Literature Assistant for Identification of Terms co-Occurrences and Relationships) that analyses co-occurrences of bioentities, biointeractions, and other biological terms in MEDLINE abstracts. The method accounts for the position of the co-occurring terms within sentences or abstracts. The system detected abstracts mentioning protein-protein interactions in a standard test (BioCreative II IAS test data) with a precision of 0.82-0.89 and a recall of 0.48-0.70. We illustrate the application of LAITOR to the detection of plant response genes in a dataset of 1000 abstracts relevant to the topic. CONCLUSIONS: Text mining tools combining the extraction of interacting bioentities and biological concepts with network displays can be helpful in developing reasonable hypotheses in different scientific backgrounds. BioMed Central 2010-02-01 /pmc/articles/PMC3098111/ /pubmed/20122157 http://dx.doi.org/10.1186/1471-2105-11-70 Text en Copyright ©2010 Barbosa-Silva et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Software Barbosa-Silva, Adriano Soldatos, Theodoros G Magalhães, Ivan LF Pavlopoulos, Georgios A Fontaine, Jean-Fred Andrade-Navarro, Miguel A Schneider, Reinhard Ortega, J Miguel LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships |
title | LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships |
title_full | LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships |
title_fullStr | LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships |
title_full_unstemmed | LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships |
title_short | LAITOR - Literature Assistant for Identification of Terms co-Occurrences and Relationships |
title_sort | laitor - literature assistant for identification of terms co-occurrences and relationships |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3098111/ https://www.ncbi.nlm.nih.gov/pubmed/20122157 http://dx.doi.org/10.1186/1471-2105-11-70 |
work_keys_str_mv | AT barbosasilvaadriano laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships AT soldatostheodorosg laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships AT magalhaesivanlf laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships AT pavlopoulosgeorgiosa laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships AT fontainejeanfred laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships AT andradenavarromiguela laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships AT schneiderreinhard laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships AT ortegajmiguel laitorliteratureassistantforidentificationoftermscooccurrencesandrelationships |