Cargando…

Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures

Protein-Protein Interaction (PPI) extraction is an important task in the biomedical information extraction. Presently, many machine learning methods for PPI extraction have achieved promising results. However, the performance is still not satisfactory. One reason is that the semantic resources were...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Lishuang, Zhang, Panpan, Zheng, Tianfu, Zhang, Hongying, Jiang, Zhenchao, Huang, Degen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3951470/
https://www.ncbi.nlm.nih.gov/pubmed/24622773
http://dx.doi.org/10.1371/journal.pone.0091898
_version_ 1782307125868888064
author Li, Lishuang
Zhang, Panpan
Zheng, Tianfu
Zhang, Hongying
Jiang, Zhenchao
Huang, Degen
author_facet Li, Lishuang
Zhang, Panpan
Zheng, Tianfu
Zhang, Hongying
Jiang, Zhenchao
Huang, Degen
author_sort Li, Lishuang
collection PubMed
description Protein-Protein Interaction (PPI) extraction is an important task in the biomedical information extraction. Presently, many machine learning methods for PPI extraction have achieved promising results. However, the performance is still not satisfactory. One reason is that the semantic resources were basically ignored. In this paper, we propose a multiple-kernel learning-based approach to extract PPIs, combining the feature-based kernel, tree kernel and semantic kernel. Particularly, we extend the shortest path-enclosed tree kernel (SPT) by a dynamic extended strategy to retrieve the richer syntactic information. Our semantic kernel calculates the protein-protein pair similarity and the context similarity based on two semantic resources: WordNet and Medical Subject Heading (MeSH). We evaluate our method with Support Vector Machine (SVM) and achieve an F-score of 69.40% and an AUC of 92.00%, which show that our method outperforms most of the state-of-the-art systems by integrating semantic information.
format Online
Article
Text
id pubmed-3951470
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39514702014-03-13 Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures Li, Lishuang Zhang, Panpan Zheng, Tianfu Zhang, Hongying Jiang, Zhenchao Huang, Degen PLoS One Research Article Protein-Protein Interaction (PPI) extraction is an important task in the biomedical information extraction. Presently, many machine learning methods for PPI extraction have achieved promising results. However, the performance is still not satisfactory. One reason is that the semantic resources were basically ignored. In this paper, we propose a multiple-kernel learning-based approach to extract PPIs, combining the feature-based kernel, tree kernel and semantic kernel. Particularly, we extend the shortest path-enclosed tree kernel (SPT) by a dynamic extended strategy to retrieve the richer syntactic information. Our semantic kernel calculates the protein-protein pair similarity and the context similarity based on two semantic resources: WordNet and Medical Subject Heading (MeSH). We evaluate our method with Support Vector Machine (SVM) and achieve an F-score of 69.40% and an AUC of 92.00%, which show that our method outperforms most of the state-of-the-art systems by integrating semantic information. Public Library of Science 2014-03-12 /pmc/articles/PMC3951470/ /pubmed/24622773 http://dx.doi.org/10.1371/journal.pone.0091898 Text en © 2014 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Li, Lishuang
Zhang, Panpan
Zheng, Tianfu
Zhang, Hongying
Jiang, Zhenchao
Huang, Degen
Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures
title Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures
title_full Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures
title_fullStr Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures
title_full_unstemmed Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures
title_short Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures
title_sort integrating semantic information into multiple kernels for protein-protein interaction extraction from biomedical literatures
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3951470/
https://www.ncbi.nlm.nih.gov/pubmed/24622773
http://dx.doi.org/10.1371/journal.pone.0091898
work_keys_str_mv AT lilishuang integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures
AT zhangpanpan integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures
AT zhengtianfu integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures
AT zhanghongying integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures
AT jiangzhenchao integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures
AT huangdegen integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures