Cargando…

LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task

Identifying the interactions between chemical compounds and genes from biomedical literatures is one of the frequently discussed topics of text mining in the life science field. In this paper, we describe Linguistic Pattern-Aware Dependency Tree Kernel, a linguistic interaction pattern learning meth...

Descripción completa

Detalles Bibliográficos
Autores principales: Warikoo, Neha, Chang, Yung-Chun, Hsu, Wen-Lian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6196310/
https://www.ncbi.nlm.nih.gov/pubmed/30346607
http://dx.doi.org/10.1093/database/bay108
_version_ 1783364531744931840
author Warikoo, Neha
Chang, Yung-Chun
Hsu, Wen-Lian
author_facet Warikoo, Neha
Chang, Yung-Chun
Hsu, Wen-Lian
author_sort Warikoo, Neha
collection PubMed
description Identifying the interactions between chemical compounds and genes from biomedical literatures is one of the frequently discussed topics of text mining in the life science field. In this paper, we describe Linguistic Pattern-Aware Dependency Tree Kernel, a linguistic interaction pattern learning method developed for CHEMPROT task–BioCreative VI, to capture chemical–protein interaction (CPI) patterns within biomedical literatures. We also introduce a framework to integrate these linguistic patterns with smooth partial tree kernel to extract the CPIs. This new method of feature representation models aspects of linguistic probability in geometric representation, which not only optimizes the sufficiency of feature dimension for classification, but also defines features as interpretable contexts rather than long vectors of numbers. In order to test the robustness and efficiency of our system in identifying different kinds of biological interactions, we evaluated our framework on three separate data sets, i.e. CHEMPROT corpus, Chemical–Disease Relation corpus and Protein–Protein Interaction corpus. Corresponding experiment results demonstrate that our method is effective and outperforms several compared systems for each data set.
format Online
Article
Text
id pubmed-6196310
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-61963102018-10-25 LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task Warikoo, Neha Chang, Yung-Chun Hsu, Wen-Lian Database (Oxford) Original Article Identifying the interactions between chemical compounds and genes from biomedical literatures is one of the frequently discussed topics of text mining in the life science field. In this paper, we describe Linguistic Pattern-Aware Dependency Tree Kernel, a linguistic interaction pattern learning method developed for CHEMPROT task–BioCreative VI, to capture chemical–protein interaction (CPI) patterns within biomedical literatures. We also introduce a framework to integrate these linguistic patterns with smooth partial tree kernel to extract the CPIs. This new method of feature representation models aspects of linguistic probability in geometric representation, which not only optimizes the sufficiency of feature dimension for classification, but also defines features as interpretable contexts rather than long vectors of numbers. In order to test the robustness and efficiency of our system in identifying different kinds of biological interactions, we evaluated our framework on three separate data sets, i.e. CHEMPROT corpus, Chemical–Disease Relation corpus and Protein–Protein Interaction corpus. Corresponding experiment results demonstrate that our method is effective and outperforms several compared systems for each data set. Oxford University Press 2018-10-22 /pmc/articles/PMC6196310/ /pubmed/30346607 http://dx.doi.org/10.1093/database/bay108 Text en © The Author(s) 2018. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Warikoo, Neha
Chang, Yung-Chun
Hsu, Wen-Lian
LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task
title LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task
title_full LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task
title_fullStr LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task
title_full_unstemmed LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task
title_short LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task
title_sort lptk: a linguistic pattern-aware dependency tree kernel approach for the biocreative vi chemprot task
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6196310/
https://www.ncbi.nlm.nih.gov/pubmed/30346607
http://dx.doi.org/10.1093/database/bay108
work_keys_str_mv AT warikooneha lptkalinguisticpatternawaredependencytreekernelapproachforthebiocreativevichemprottask
AT changyungchun lptkalinguisticpatternawaredependencytreekernelapproachforthebiocreativevichemprottask
AT hsuwenlian lptkalinguisticpatternawaredependencytreekernelapproachforthebiocreativevichemprottask