Cargando…

Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information

BACKGROUND: The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest. RESULTS: This paper reports a scalable method for the discovery of protein-protein interactions in Med...

Descripción completa

Detalles Bibliográficos
Autores principales: Cooper, James W, Kershenbaum, Aaron
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1164402/
https://www.ncbi.nlm.nih.gov/pubmed/15941473
http://dx.doi.org/10.1186/1471-2105-6-143
_version_ 1782124407328604160
author Cooper, James W
Kershenbaum, Aaron
author_facet Cooper, James W
Kershenbaum, Aaron
author_sort Cooper, James W
collection PubMed
description BACKGROUND: The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest. RESULTS: This paper reports a scalable method for the discovery of protein-protein interactions in Medline abstracts, using a combination of text analytics, statistical and graphical analysis, and a set of easily implemented rules. Applying these techniques to 12,300 abstracts, a precision of 0.61 and a recall of 0.97 were obtained, (f = 0.74) and when allowing for two-hop and three-hop relations discovered by graphical analysis, the precision was 0.74 (f = 0.83). CONCLUSION: This combination of linguistic and statistical approaches appears to provide the highest precision and recall thus far reported in detecting protein-protein relations using text analytic approaches.
format Text
id pubmed-1164402
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-11644022005-06-29 Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information Cooper, James W Kershenbaum, Aaron BMC Bioinformatics Methodology Article BACKGROUND: The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest. RESULTS: This paper reports a scalable method for the discovery of protein-protein interactions in Medline abstracts, using a combination of text analytics, statistical and graphical analysis, and a set of easily implemented rules. Applying these techniques to 12,300 abstracts, a precision of 0.61 and a recall of 0.97 were obtained, (f = 0.74) and when allowing for two-hop and three-hop relations discovered by graphical analysis, the precision was 0.74 (f = 0.83). CONCLUSION: This combination of linguistic and statistical approaches appears to provide the highest precision and recall thus far reported in detecting protein-protein relations using text analytic approaches. BioMed Central 2005-06-07 /pmc/articles/PMC1164402/ /pubmed/15941473 http://dx.doi.org/10.1186/1471-2105-6-143 Text en Copyright © 2005 Cooper and Kershenbaum; licensee BioMed Central Ltd.
spellingShingle Methodology Article
Cooper, James W
Kershenbaum, Aaron
Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
title Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
title_full Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
title_fullStr Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
title_full_unstemmed Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
title_short Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
title_sort discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1164402/
https://www.ncbi.nlm.nih.gov/pubmed/15941473
http://dx.doi.org/10.1186/1471-2105-6-143
work_keys_str_mv AT cooperjamesw discoveryofproteinproteininteractionsusingacombinationoflinguisticstatisticalandgraphicalinformation
AT kershenbaumaaron discoveryofproteinproteininteractionsusingacombinationoflinguisticstatisticalandgraphicalinformation