Cargando…

Extracting rate changes in transcriptional regulation from MEDLINE abstracts

BACKGROUND: Time delays are important factors that are often neglected in gene regulatory network (GRN) inference models. Validating time delays from knowledge bases is a challenge since the vast majority of biological databases do not record temporal information of gene regulations. Biological know...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Wenting, Miao, Kui, Li, Guangxia, Chang, Kuiyu, Zheng, Jie, Rajapakse, Jagath C
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015878/
https://www.ncbi.nlm.nih.gov/pubmed/24564647
http://dx.doi.org/10.1186/1471-2105-15-S2-S4
_version_ 1782315418678984704
author Liu, Wenting
Miao, Kui
Li, Guangxia
Chang, Kuiyu
Zheng, Jie
Rajapakse, Jagath C
author_facet Liu, Wenting
Miao, Kui
Li, Guangxia
Chang, Kuiyu
Zheng, Jie
Rajapakse, Jagath C
author_sort Liu, Wenting
collection PubMed
description BACKGROUND: Time delays are important factors that are often neglected in gene regulatory network (GRN) inference models. Validating time delays from knowledge bases is a challenge since the vast majority of biological databases do not record temporal information of gene regulations. Biological knowledge and facts on gene regulations are typically extracted from bio-literature with specialized methods that depend on the regulation task. In this paper, we mine evidences for time delays related to the transcriptional regulation of yeast from the PubMed abstracts. RESULTS: Since the vast majority of abstracts lack quantitative time information, we can only collect qualitative evidences of time delays. Specifically, the speed-up or delay in transcriptional regulation rate can provide evidences for time delays (shorter or longer) in GRN. Thus, we focus on deriving events related to rate changes in transcriptional regulation. A corpus of yeast regulation related abstracts was manually labeled with such events. In order to capture these events automatically, we create an ontology of sub-processes that are likely to result in transcription rate changes by combining textual patterns and biological knowledge. We also propose effective feature extraction methods based on the created ontology to identify the direct evidences with specific details of these events. Our ontologies outperform existing state-of-the-art gene regulation ontologies in the automatic rule learning method applied to our corpus. The proposed deterministic ontology rule-based method can achieve comparable performance to the automatic rule learning method based on decision trees. This demonstrates the effectiveness of our ontology in identifying rate-changing events. We also tested the effectiveness of the proposed feature mining methods on detecting direct evidence of events. Experimental results show that the machine learning method on these features achieves an F1-score of 71.43%. CONCLUSIONS: The manually labeled corpus of events relating to rate changes in transcriptional regulation for yeast is available in https://sites.google.com/site/wentingntu/data. The created ontologies summarized both biological causes of rate changes in transcriptional regulation and corresponding positive and negative textual patterns from the corpus. They are demonstrated to be effective in identifying rate-changing events, which shows the benefits of combining textual patterns and biological knowledge on extracting complex biological events.
format Online
Article
Text
id pubmed-4015878
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40158782014-05-23 Extracting rate changes in transcriptional regulation from MEDLINE abstracts Liu, Wenting Miao, Kui Li, Guangxia Chang, Kuiyu Zheng, Jie Rajapakse, Jagath C BMC Bioinformatics Proceedings BACKGROUND: Time delays are important factors that are often neglected in gene regulatory network (GRN) inference models. Validating time delays from knowledge bases is a challenge since the vast majority of biological databases do not record temporal information of gene regulations. Biological knowledge and facts on gene regulations are typically extracted from bio-literature with specialized methods that depend on the regulation task. In this paper, we mine evidences for time delays related to the transcriptional regulation of yeast from the PubMed abstracts. RESULTS: Since the vast majority of abstracts lack quantitative time information, we can only collect qualitative evidences of time delays. Specifically, the speed-up or delay in transcriptional regulation rate can provide evidences for time delays (shorter or longer) in GRN. Thus, we focus on deriving events related to rate changes in transcriptional regulation. A corpus of yeast regulation related abstracts was manually labeled with such events. In order to capture these events automatically, we create an ontology of sub-processes that are likely to result in transcription rate changes by combining textual patterns and biological knowledge. We also propose effective feature extraction methods based on the created ontology to identify the direct evidences with specific details of these events. Our ontologies outperform existing state-of-the-art gene regulation ontologies in the automatic rule learning method applied to our corpus. The proposed deterministic ontology rule-based method can achieve comparable performance to the automatic rule learning method based on decision trees. This demonstrates the effectiveness of our ontology in identifying rate-changing events. We also tested the effectiveness of the proposed feature mining methods on detecting direct evidence of events. Experimental results show that the machine learning method on these features achieves an F1-score of 71.43%. CONCLUSIONS: The manually labeled corpus of events relating to rate changes in transcriptional regulation for yeast is available in https://sites.google.com/site/wentingntu/data. The created ontologies summarized both biological causes of rate changes in transcriptional regulation and corresponding positive and negative textual patterns from the corpus. They are demonstrated to be effective in identifying rate-changing events, which shows the benefits of combining textual patterns and biological knowledge on extracting complex biological events. BioMed Central 2014-01-24 /pmc/articles/PMC4015878/ /pubmed/24564647 http://dx.doi.org/10.1186/1471-2105-15-S2-S4 Text en Copyright © 2014 Liu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Liu, Wenting
Miao, Kui
Li, Guangxia
Chang, Kuiyu
Zheng, Jie
Rajapakse, Jagath C
Extracting rate changes in transcriptional regulation from MEDLINE abstracts
title Extracting rate changes in transcriptional regulation from MEDLINE abstracts
title_full Extracting rate changes in transcriptional regulation from MEDLINE abstracts
title_fullStr Extracting rate changes in transcriptional regulation from MEDLINE abstracts
title_full_unstemmed Extracting rate changes in transcriptional regulation from MEDLINE abstracts
title_short Extracting rate changes in transcriptional regulation from MEDLINE abstracts
title_sort extracting rate changes in transcriptional regulation from medline abstracts
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015878/
https://www.ncbi.nlm.nih.gov/pubmed/24564647
http://dx.doi.org/10.1186/1471-2105-15-S2-S4
work_keys_str_mv AT liuwenting extractingratechangesintranscriptionalregulationfrommedlineabstracts
AT miaokui extractingratechangesintranscriptionalregulationfrommedlineabstracts
AT liguangxia extractingratechangesintranscriptionalregulationfrommedlineabstracts
AT changkuiyu extractingratechangesintranscriptionalregulationfrommedlineabstracts
AT zhengjie extractingratechangesintranscriptionalregulationfrommedlineabstracts
AT rajapaksejagathc extractingratechangesintranscriptionalregulationfrommedlineabstracts