Cargando…
Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures
Protein-Protein Interaction (PPI) extraction is an important task in the biomedical information extraction. Presently, many machine learning methods for PPI extraction have achieved promising results. However, the performance is still not satisfactory. One reason is that the semantic resources were...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3951470/ https://www.ncbi.nlm.nih.gov/pubmed/24622773 http://dx.doi.org/10.1371/journal.pone.0091898 |
_version_ | 1782307125868888064 |
---|---|
author | Li, Lishuang Zhang, Panpan Zheng, Tianfu Zhang, Hongying Jiang, Zhenchao Huang, Degen |
author_facet | Li, Lishuang Zhang, Panpan Zheng, Tianfu Zhang, Hongying Jiang, Zhenchao Huang, Degen |
author_sort | Li, Lishuang |
collection | PubMed |
description | Protein-Protein Interaction (PPI) extraction is an important task in the biomedical information extraction. Presently, many machine learning methods for PPI extraction have achieved promising results. However, the performance is still not satisfactory. One reason is that the semantic resources were basically ignored. In this paper, we propose a multiple-kernel learning-based approach to extract PPIs, combining the feature-based kernel, tree kernel and semantic kernel. Particularly, we extend the shortest path-enclosed tree kernel (SPT) by a dynamic extended strategy to retrieve the richer syntactic information. Our semantic kernel calculates the protein-protein pair similarity and the context similarity based on two semantic resources: WordNet and Medical Subject Heading (MeSH). We evaluate our method with Support Vector Machine (SVM) and achieve an F-score of 69.40% and an AUC of 92.00%, which show that our method outperforms most of the state-of-the-art systems by integrating semantic information. |
format | Online Article Text |
id | pubmed-3951470 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-39514702014-03-13 Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures Li, Lishuang Zhang, Panpan Zheng, Tianfu Zhang, Hongying Jiang, Zhenchao Huang, Degen PLoS One Research Article Protein-Protein Interaction (PPI) extraction is an important task in the biomedical information extraction. Presently, many machine learning methods for PPI extraction have achieved promising results. However, the performance is still not satisfactory. One reason is that the semantic resources were basically ignored. In this paper, we propose a multiple-kernel learning-based approach to extract PPIs, combining the feature-based kernel, tree kernel and semantic kernel. Particularly, we extend the shortest path-enclosed tree kernel (SPT) by a dynamic extended strategy to retrieve the richer syntactic information. Our semantic kernel calculates the protein-protein pair similarity and the context similarity based on two semantic resources: WordNet and Medical Subject Heading (MeSH). We evaluate our method with Support Vector Machine (SVM) and achieve an F-score of 69.40% and an AUC of 92.00%, which show that our method outperforms most of the state-of-the-art systems by integrating semantic information. Public Library of Science 2014-03-12 /pmc/articles/PMC3951470/ /pubmed/24622773 http://dx.doi.org/10.1371/journal.pone.0091898 Text en © 2014 Li et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Li, Lishuang Zhang, Panpan Zheng, Tianfu Zhang, Hongying Jiang, Zhenchao Huang, Degen Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures |
title | Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures |
title_full | Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures |
title_fullStr | Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures |
title_full_unstemmed | Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures |
title_short | Integrating Semantic Information into Multiple Kernels for Protein-Protein Interaction Extraction from Biomedical Literatures |
title_sort | integrating semantic information into multiple kernels for protein-protein interaction extraction from biomedical literatures |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3951470/ https://www.ncbi.nlm.nih.gov/pubmed/24622773 http://dx.doi.org/10.1371/journal.pone.0091898 |
work_keys_str_mv | AT lilishuang integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures AT zhangpanpan integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures AT zhengtianfu integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures AT zhanghongying integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures AT jiangzhenchao integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures AT huangdegen integratingsemanticinformationintomultiplekernelsforproteinproteininteractionextractionfrombiomedicalliteratures |