Cargando…
Extracting rate changes in transcriptional regulation from MEDLINE abstracts
BACKGROUND: Time delays are important factors that are often neglected in gene regulatory network (GRN) inference models. Validating time delays from knowledge bases is a challenge since the vast majority of biological databases do not record temporal information of gene regulations. Biological know...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015878/ https://www.ncbi.nlm.nih.gov/pubmed/24564647 http://dx.doi.org/10.1186/1471-2105-15-S2-S4 |
_version_ | 1782315418678984704 |
---|---|
author | Liu, Wenting Miao, Kui Li, Guangxia Chang, Kuiyu Zheng, Jie Rajapakse, Jagath C |
author_facet | Liu, Wenting Miao, Kui Li, Guangxia Chang, Kuiyu Zheng, Jie Rajapakse, Jagath C |
author_sort | Liu, Wenting |
collection | PubMed |
description | BACKGROUND: Time delays are important factors that are often neglected in gene regulatory network (GRN) inference models. Validating time delays from knowledge bases is a challenge since the vast majority of biological databases do not record temporal information of gene regulations. Biological knowledge and facts on gene regulations are typically extracted from bio-literature with specialized methods that depend on the regulation task. In this paper, we mine evidences for time delays related to the transcriptional regulation of yeast from the PubMed abstracts. RESULTS: Since the vast majority of abstracts lack quantitative time information, we can only collect qualitative evidences of time delays. Specifically, the speed-up or delay in transcriptional regulation rate can provide evidences for time delays (shorter or longer) in GRN. Thus, we focus on deriving events related to rate changes in transcriptional regulation. A corpus of yeast regulation related abstracts was manually labeled with such events. In order to capture these events automatically, we create an ontology of sub-processes that are likely to result in transcription rate changes by combining textual patterns and biological knowledge. We also propose effective feature extraction methods based on the created ontology to identify the direct evidences with specific details of these events. Our ontologies outperform existing state-of-the-art gene regulation ontologies in the automatic rule learning method applied to our corpus. The proposed deterministic ontology rule-based method can achieve comparable performance to the automatic rule learning method based on decision trees. This demonstrates the effectiveness of our ontology in identifying rate-changing events. We also tested the effectiveness of the proposed feature mining methods on detecting direct evidence of events. Experimental results show that the machine learning method on these features achieves an F1-score of 71.43%. CONCLUSIONS: The manually labeled corpus of events relating to rate changes in transcriptional regulation for yeast is available in https://sites.google.com/site/wentingntu/data. The created ontologies summarized both biological causes of rate changes in transcriptional regulation and corresponding positive and negative textual patterns from the corpus. They are demonstrated to be effective in identifying rate-changing events, which shows the benefits of combining textual patterns and biological knowledge on extracting complex biological events. |
format | Online Article Text |
id | pubmed-4015878 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-40158782014-05-23 Extracting rate changes in transcriptional regulation from MEDLINE abstracts Liu, Wenting Miao, Kui Li, Guangxia Chang, Kuiyu Zheng, Jie Rajapakse, Jagath C BMC Bioinformatics Proceedings BACKGROUND: Time delays are important factors that are often neglected in gene regulatory network (GRN) inference models. Validating time delays from knowledge bases is a challenge since the vast majority of biological databases do not record temporal information of gene regulations. Biological knowledge and facts on gene regulations are typically extracted from bio-literature with specialized methods that depend on the regulation task. In this paper, we mine evidences for time delays related to the transcriptional regulation of yeast from the PubMed abstracts. RESULTS: Since the vast majority of abstracts lack quantitative time information, we can only collect qualitative evidences of time delays. Specifically, the speed-up or delay in transcriptional regulation rate can provide evidences for time delays (shorter or longer) in GRN. Thus, we focus on deriving events related to rate changes in transcriptional regulation. A corpus of yeast regulation related abstracts was manually labeled with such events. In order to capture these events automatically, we create an ontology of sub-processes that are likely to result in transcription rate changes by combining textual patterns and biological knowledge. We also propose effective feature extraction methods based on the created ontology to identify the direct evidences with specific details of these events. Our ontologies outperform existing state-of-the-art gene regulation ontologies in the automatic rule learning method applied to our corpus. The proposed deterministic ontology rule-based method can achieve comparable performance to the automatic rule learning method based on decision trees. This demonstrates the effectiveness of our ontology in identifying rate-changing events. We also tested the effectiveness of the proposed feature mining methods on detecting direct evidence of events. Experimental results show that the machine learning method on these features achieves an F1-score of 71.43%. CONCLUSIONS: The manually labeled corpus of events relating to rate changes in transcriptional regulation for yeast is available in https://sites.google.com/site/wentingntu/data. The created ontologies summarized both biological causes of rate changes in transcriptional regulation and corresponding positive and negative textual patterns from the corpus. They are demonstrated to be effective in identifying rate-changing events, which shows the benefits of combining textual patterns and biological knowledge on extracting complex biological events. BioMed Central 2014-01-24 /pmc/articles/PMC4015878/ /pubmed/24564647 http://dx.doi.org/10.1186/1471-2105-15-S2-S4 Text en Copyright © 2014 Liu et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Proceedings Liu, Wenting Miao, Kui Li, Guangxia Chang, Kuiyu Zheng, Jie Rajapakse, Jagath C Extracting rate changes in transcriptional regulation from MEDLINE abstracts |
title | Extracting rate changes in transcriptional regulation from MEDLINE abstracts |
title_full | Extracting rate changes in transcriptional regulation from MEDLINE abstracts |
title_fullStr | Extracting rate changes in transcriptional regulation from MEDLINE abstracts |
title_full_unstemmed | Extracting rate changes in transcriptional regulation from MEDLINE abstracts |
title_short | Extracting rate changes in transcriptional regulation from MEDLINE abstracts |
title_sort | extracting rate changes in transcriptional regulation from medline abstracts |
topic | Proceedings |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4015878/ https://www.ncbi.nlm.nih.gov/pubmed/24564647 http://dx.doi.org/10.1186/1471-2105-15-S2-S4 |
work_keys_str_mv | AT liuwenting extractingratechangesintranscriptionalregulationfrommedlineabstracts AT miaokui extractingratechangesintranscriptionalregulationfrommedlineabstracts AT liguangxia extractingratechangesintranscriptionalregulationfrommedlineabstracts AT changkuiyu extractingratechangesintranscriptionalregulationfrommedlineabstracts AT zhengjie extractingratechangesintranscriptionalregulationfrommedlineabstracts AT rajapaksejagathc extractingratechangesintranscriptionalregulationfrommedlineabstracts |