Cargando…

PathBinder – text empirics and automatic extraction of biomolecular interactions

MOTIVATION: The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not r...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Lifeng, Berleant, Daniel, Ding, Jing, Cao, Tuan, Syrkin Wurtele, Eve
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3226189/
https://www.ncbi.nlm.nih.gov/pubmed/19811683
http://dx.doi.org/10.1186/1471-2105-10-S11-S18
Descripción
Sumario:MOTIVATION: The increasingly large amount of free, online biological text makes automatic interaction extraction correspondingly attractive. Machine learning is one strategy that works by uncovering and using useful properties that are implicit in the text. However these properties are usually not reported in the literature explicitly. By investigating specific properties of biological text passages in this paper, we aim to facilitate an alternative strategy, the use of text empirics, to support mining of biomedical texts for biomolecular interactions. We report on our application of this approach, and also report some empirical findings about an important class of passages. These may be useful to others who may also wish to use the empirical properties we describe. RESULTS: We manually analyzed syntactic and semantic properties of sentences likely to describe interactions between biomolecules. The resulting empirical data were used to design an algorithm for the PathBinder system to extract biomolecular interactions from texts. PathBinder searches PubMed for sentences describing interactions between two given biomolecules. PathBinder then uses probabilistic methods to combine evidence from multiple relevant sentences in PubMed to assess the relative likelihood of interaction between two arbitrary biomolecules. A biomolecular interaction network was constructed based on those likelihoods. CONCLUSION: The text empirics approach used here supports computationally friendly, performance competitive, automatic extraction of biomolecular interactions from texts. AVAILABILITY: http://www.metnetdb.org/pathbinder.