Cargando…

Walk-weighted subsequence kernels for protein-protein interaction extraction

BACKGROUND: The construction of interaction networks between proteins is central to understanding the underlying biological processes. However, since many useful relations are excluded in databases and remain hidden in raw text, a study on automatic interaction extraction from text is important in b...

Descripción completa

Detalles Bibliográficos
Autores principales: Kim, Seonho, Yoon, Juntae, Yang, Jihoon, Park, Seog
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2844389/
https://www.ncbi.nlm.nih.gov/pubmed/20184736
http://dx.doi.org/10.1186/1471-2105-11-107
_version_ 1782179300289544192
author Kim, Seonho
Yoon, Juntae
Yang, Jihoon
Park, Seog
author_facet Kim, Seonho
Yoon, Juntae
Yang, Jihoon
Park, Seog
author_sort Kim, Seonho
collection PubMed
description BACKGROUND: The construction of interaction networks between proteins is central to understanding the underlying biological processes. However, since many useful relations are excluded in databases and remain hidden in raw text, a study on automatic interaction extraction from text is important in bioinformatics field. RESULTS: Here, we suggest two kinds of kernel methods for genic interaction extraction, considering the structural aspects of sentences. First, we improve our prior dependency kernel by modifying the kernel function so that it can involve various substructures in terms of (1) e-walks, (2) partial match, (3) non-contiguous paths, and (4) different significance of substructures. Second, we propose the walk-weighted subsequence kernel to parameterize non-contiguous syntactic structures as well as semantic roles and lexical features, which makes learning structural aspects from a small amount of training data effective. Furthermore, we distinguish the significances of parameters such as syntactic locality, semantic roles, and lexical features by varying their weights. CONCLUSIONS: We addressed the genic interaction problem with various dependency kernels and suggested various structural kernel scenarios based on the directed shortest dependency path connecting two entities. Consequently, we obtained promising results over genic interaction data sets with the walk-weighted subsequence kernel. The results are compared using automatically parsed third party protein-protein interaction (PPI) data as well as perfectly syntactic labeled PPI data.
format Text
id pubmed-2844389
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28443892010-03-24 Walk-weighted subsequence kernels for protein-protein interaction extraction Kim, Seonho Yoon, Juntae Yang, Jihoon Park, Seog BMC Bioinformatics Research article BACKGROUND: The construction of interaction networks between proteins is central to understanding the underlying biological processes. However, since many useful relations are excluded in databases and remain hidden in raw text, a study on automatic interaction extraction from text is important in bioinformatics field. RESULTS: Here, we suggest two kinds of kernel methods for genic interaction extraction, considering the structural aspects of sentences. First, we improve our prior dependency kernel by modifying the kernel function so that it can involve various substructures in terms of (1) e-walks, (2) partial match, (3) non-contiguous paths, and (4) different significance of substructures. Second, we propose the walk-weighted subsequence kernel to parameterize non-contiguous syntactic structures as well as semantic roles and lexical features, which makes learning structural aspects from a small amount of training data effective. Furthermore, we distinguish the significances of parameters such as syntactic locality, semantic roles, and lexical features by varying their weights. CONCLUSIONS: We addressed the genic interaction problem with various dependency kernels and suggested various structural kernel scenarios based on the directed shortest dependency path connecting two entities. Consequently, we obtained promising results over genic interaction data sets with the walk-weighted subsequence kernel. The results are compared using automatically parsed third party protein-protein interaction (PPI) data as well as perfectly syntactic labeled PPI data. BioMed Central 2010-02-25 /pmc/articles/PMC2844389/ /pubmed/20184736 http://dx.doi.org/10.1186/1471-2105-11-107 Text en Copyright ©2010 Kim et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Kim, Seonho
Yoon, Juntae
Yang, Jihoon
Park, Seog
Walk-weighted subsequence kernels for protein-protein interaction extraction
title Walk-weighted subsequence kernels for protein-protein interaction extraction
title_full Walk-weighted subsequence kernels for protein-protein interaction extraction
title_fullStr Walk-weighted subsequence kernels for protein-protein interaction extraction
title_full_unstemmed Walk-weighted subsequence kernels for protein-protein interaction extraction
title_short Walk-weighted subsequence kernels for protein-protein interaction extraction
title_sort walk-weighted subsequence kernels for protein-protein interaction extraction
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2844389/
https://www.ncbi.nlm.nih.gov/pubmed/20184736
http://dx.doi.org/10.1186/1471-2105-11-107
work_keys_str_mv AT kimseonho walkweightedsubsequencekernelsforproteinproteininteractionextraction
AT yoonjuntae walkweightedsubsequencekernelsforproteinproteininteractionextraction
AT yangjihoon walkweightedsubsequencekernelsforproteinproteininteractionextraction
AT parkseog walkweightedsubsequencekernelsforproteinproteininteractionextraction