Cargando…
Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information
BACKGROUND: The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest. RESULTS: This paper reports a scalable method for the discovery of protein-protein interactions in Med...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2005
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1164402/ https://www.ncbi.nlm.nih.gov/pubmed/15941473 http://dx.doi.org/10.1186/1471-2105-6-143 |
_version_ | 1782124407328604160 |
---|---|
author | Cooper, James W Kershenbaum, Aaron |
author_facet | Cooper, James W Kershenbaum, Aaron |
author_sort | Cooper, James W |
collection | PubMed |
description | BACKGROUND: The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest. RESULTS: This paper reports a scalable method for the discovery of protein-protein interactions in Medline abstracts, using a combination of text analytics, statistical and graphical analysis, and a set of easily implemented rules. Applying these techniques to 12,300 abstracts, a precision of 0.61 and a recall of 0.97 were obtained, (f = 0.74) and when allowing for two-hop and three-hop relations discovered by graphical analysis, the precision was 0.74 (f = 0.83). CONCLUSION: This combination of linguistic and statistical approaches appears to provide the highest precision and recall thus far reported in detecting protein-protein relations using text analytic approaches. |
format | Text |
id | pubmed-1164402 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2005 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-11644022005-06-29 Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information Cooper, James W Kershenbaum, Aaron BMC Bioinformatics Methodology Article BACKGROUND: The rapid publication of important research in the biomedical literature makes it increasingly difficult for researchers to keep current with significant work in their area of interest. RESULTS: This paper reports a scalable method for the discovery of protein-protein interactions in Medline abstracts, using a combination of text analytics, statistical and graphical analysis, and a set of easily implemented rules. Applying these techniques to 12,300 abstracts, a precision of 0.61 and a recall of 0.97 were obtained, (f = 0.74) and when allowing for two-hop and three-hop relations discovered by graphical analysis, the precision was 0.74 (f = 0.83). CONCLUSION: This combination of linguistic and statistical approaches appears to provide the highest precision and recall thus far reported in detecting protein-protein relations using text analytic approaches. BioMed Central 2005-06-07 /pmc/articles/PMC1164402/ /pubmed/15941473 http://dx.doi.org/10.1186/1471-2105-6-143 Text en Copyright © 2005 Cooper and Kershenbaum; licensee BioMed Central Ltd. |
spellingShingle | Methodology Article Cooper, James W Kershenbaum, Aaron Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information |
title | Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information |
title_full | Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information |
title_fullStr | Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information |
title_full_unstemmed | Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information |
title_short | Discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information |
title_sort | discovery of protein-protein interactions using a combination of linguistic, statistical and graphical information |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1164402/ https://www.ncbi.nlm.nih.gov/pubmed/15941473 http://dx.doi.org/10.1186/1471-2105-6-143 |
work_keys_str_mv | AT cooperjamesw discoveryofproteinproteininteractionsusingacombinationoflinguisticstatisticalandgraphicalinformation AT kershenbaumaaron discoveryofproteinproteininteractionsusingacombinationoflinguisticstatisticalandgraphicalinformation |