Cargando…

Multiple kernels learning-based biological entity relationship extraction method

BACKGROUND: Automatic extracting protein entity interaction information from biomedical literature can help to build protein relation network and design new drugs. There are more than 20 million literature abstracts included in MEDLINE, which is the most authoritative textual database in the field o...

Descripción completa

Detalles Bibliográficos
Autores principales: Dongliang, Xu, Jingchang, Pan, Bailing, Wang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763518/
https://www.ncbi.nlm.nih.gov/pubmed/29297359
http://dx.doi.org/10.1186/s13326-017-0138-9
_version_ 1783291899005632512
author Dongliang, Xu
Jingchang, Pan
Bailing, Wang
author_facet Dongliang, Xu
Jingchang, Pan
Bailing, Wang
author_sort Dongliang, Xu
collection PubMed
description BACKGROUND: Automatic extracting protein entity interaction information from biomedical literature can help to build protein relation network and design new drugs. There are more than 20 million literature abstracts included in MEDLINE, which is the most authoritative textual database in the field of biomedicine, and follow an exponential growth over time. This frantic expansion of the biomedical literature can often be difficult to absorb or manually analyze. Thus efficient and automated search engines are necessary to efficiently explore the biomedical literature using text mining techniques. RESULTS: The P, R, and F value of tag graph method in Aimed corpus are 50.82, 69.76, and 58.61%, respectively. The P, R, and F value of tag graph kernel method in other four evaluation corpuses are 2–5% higher than that of all-paths graph kernel. And The P, R and F value of feature kernel and tag graph kernel fuse methods is 53.43, 71.62 and 61.30%, respectively. The P, R and F value of feature kernel and tag graph kernel fuse methods is 55.47, 70.29 and 60.37%, respectively. It indicated that the performance of the two kinds of kernel fusion methods is better than that of simple kernel. CONCLUSION: In comparison with the all-paths graph kernel method, the tag graph kernel method is superior in terms of overall performance. Experiments show that the performance of the multi-kernels method is better than that of the three separate single-kernel method and the dual-mutually fused kernel method used hereof in five corpus sets.
format Online
Article
Text
id pubmed-5763518
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57635182018-01-17 Multiple kernels learning-based biological entity relationship extraction method Dongliang, Xu Jingchang, Pan Bailing, Wang J Biomed Semantics Research BACKGROUND: Automatic extracting protein entity interaction information from biomedical literature can help to build protein relation network and design new drugs. There are more than 20 million literature abstracts included in MEDLINE, which is the most authoritative textual database in the field of biomedicine, and follow an exponential growth over time. This frantic expansion of the biomedical literature can often be difficult to absorb or manually analyze. Thus efficient and automated search engines are necessary to efficiently explore the biomedical literature using text mining techniques. RESULTS: The P, R, and F value of tag graph method in Aimed corpus are 50.82, 69.76, and 58.61%, respectively. The P, R, and F value of tag graph kernel method in other four evaluation corpuses are 2–5% higher than that of all-paths graph kernel. And The P, R and F value of feature kernel and tag graph kernel fuse methods is 53.43, 71.62 and 61.30%, respectively. The P, R and F value of feature kernel and tag graph kernel fuse methods is 55.47, 70.29 and 60.37%, respectively. It indicated that the performance of the two kinds of kernel fusion methods is better than that of simple kernel. CONCLUSION: In comparison with the all-paths graph kernel method, the tag graph kernel method is superior in terms of overall performance. Experiments show that the performance of the multi-kernels method is better than that of the three separate single-kernel method and the dual-mutually fused kernel method used hereof in five corpus sets. BioMed Central 2017-09-20 /pmc/articles/PMC5763518/ /pubmed/29297359 http://dx.doi.org/10.1186/s13326-017-0138-9 Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Dongliang, Xu
Jingchang, Pan
Bailing, Wang
Multiple kernels learning-based biological entity relationship extraction method
title Multiple kernels learning-based biological entity relationship extraction method
title_full Multiple kernels learning-based biological entity relationship extraction method
title_fullStr Multiple kernels learning-based biological entity relationship extraction method
title_full_unstemmed Multiple kernels learning-based biological entity relationship extraction method
title_short Multiple kernels learning-based biological entity relationship extraction method
title_sort multiple kernels learning-based biological entity relationship extraction method
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763518/
https://www.ncbi.nlm.nih.gov/pubmed/29297359
http://dx.doi.org/10.1186/s13326-017-0138-9
work_keys_str_mv AT dongliangxu multiplekernelslearningbasedbiologicalentityrelationshipextractionmethod
AT jingchangpan multiplekernelslearningbasedbiologicalentityrelationshipextractionmethod
AT bailingwang multiplekernelslearningbasedbiologicalentityrelationshipextractionmethod