Cargando…

Detection of Gene Interactions Based on Syntactic Relations

Interactions between proteins and genes are considered essential in the description of biomolecular phenomena, and networks of interactions are applied in a system's biology approach. Recently, many studies have sought to extract information from biomolecular text using natural language process...

Descripción completa

Detalles Bibliográficos
Autor principal: Kim, Mi-Young
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2008
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2277490/
https://www.ncbi.nlm.nih.gov/pubmed/18385820
http://dx.doi.org/10.1155/2008/371710
_version_ 1782152038295011328
author Kim, Mi-Young
author_facet Kim, Mi-Young
author_sort Kim, Mi-Young
collection PubMed
description Interactions between proteins and genes are considered essential in the description of biomolecular phenomena, and networks of interactions are applied in a system's biology approach. Recently, many studies have sought to extract information from biomolecular text using natural language processing technology. Previous studies have asserted that linguistic information is useful for improving the detection of gene interactions. In particular, syntactic relations among linguistic information are good for detecting gene interactions. However, previous systems give a reasonably good precision but poor recall. To improve recall without sacrificing precision, this paper proposes a three-phase method for detecting gene interactions based on syntactic relations. In the first phase, we retrieve syntactic encapsulation categories for each candidate agent and target. In the second phase, we construct a verb list that indicates the nature of the interaction between pairs of genes. In the last phase, we determine direction rules to detect which of two genes is the agent or target. Even without biomolecular knowledge, our method performs reasonably well using a small training dataset. While the first phase contributes to improve recall, the second and third phases contribute to improve precision. In the experimental results using ICML 05 Workshop on Learning Language in Logic (LLL05) data, our proposed method gave an F-measure of 67.2% for the test data, significantly outperforming previous methods. We also describe the contribution of each phase to the performance.
format Text
id pubmed-2277490
institution National Center for Biotechnology Information
language English
publishDate 2008
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-22774902008-04-02 Detection of Gene Interactions Based on Syntactic Relations Kim, Mi-Young J Biomed Biotechnol Research Article Interactions between proteins and genes are considered essential in the description of biomolecular phenomena, and networks of interactions are applied in a system's biology approach. Recently, many studies have sought to extract information from biomolecular text using natural language processing technology. Previous studies have asserted that linguistic information is useful for improving the detection of gene interactions. In particular, syntactic relations among linguistic information are good for detecting gene interactions. However, previous systems give a reasonably good precision but poor recall. To improve recall without sacrificing precision, this paper proposes a three-phase method for detecting gene interactions based on syntactic relations. In the first phase, we retrieve syntactic encapsulation categories for each candidate agent and target. In the second phase, we construct a verb list that indicates the nature of the interaction between pairs of genes. In the last phase, we determine direction rules to detect which of two genes is the agent or target. Even without biomolecular knowledge, our method performs reasonably well using a small training dataset. While the first phase contributes to improve recall, the second and third phases contribute to improve precision. In the experimental results using ICML 05 Workshop on Learning Language in Logic (LLL05) data, our proposed method gave an F-measure of 67.2% for the test data, significantly outperforming previous methods. We also describe the contribution of each phase to the performance. Hindawi Publishing Corporation 2008 2008-03-16 /pmc/articles/PMC2277490/ /pubmed/18385820 http://dx.doi.org/10.1155/2008/371710 Text en Copyright © 2008 Mi-Young Kim. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kim, Mi-Young
Detection of Gene Interactions Based on Syntactic Relations
title Detection of Gene Interactions Based on Syntactic Relations
title_full Detection of Gene Interactions Based on Syntactic Relations
title_fullStr Detection of Gene Interactions Based on Syntactic Relations
title_full_unstemmed Detection of Gene Interactions Based on Syntactic Relations
title_short Detection of Gene Interactions Based on Syntactic Relations
title_sort detection of gene interactions based on syntactic relations
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2277490/
https://www.ncbi.nlm.nih.gov/pubmed/18385820
http://dx.doi.org/10.1155/2008/371710
work_keys_str_mv AT kimmiyoung detectionofgeneinteractionsbasedonsyntacticrelations