Cargando…
An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets
Efficient and effective analysis of the growing genomic databases requires the development of adequate computational tools. We introduce a fast method based on the suffix tree data structure for predicting novel targets of hypoxia-inducible factor 1 (HIF-1) from huge genome databases. The suffix tre...
Autores principales: | , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2664698/ https://www.ncbi.nlm.nih.gov/pubmed/19352460 |
_version_ | 1782165979294334976 |
---|---|
author | Jiang, Yue Cukic, Bojan Adjeroh, Donald A. Skinner, Heath D. Lin, Jie Shen, Qingxi J. Jiang, Bing-Hua |
author_facet | Jiang, Yue Cukic, Bojan Adjeroh, Donald A. Skinner, Heath D. Lin, Jie Shen, Qingxi J. Jiang, Bing-Hua |
author_sort | Jiang, Yue |
collection | PubMed |
description | Efficient and effective analysis of the growing genomic databases requires the development of adequate computational tools. We introduce a fast method based on the suffix tree data structure for predicting novel targets of hypoxia-inducible factor 1 (HIF-1) from huge genome databases. The suffix tree data structure has two powerful applications here: one is to extract unknown patterns from multiple strings/sequences in linear time; the other is to search multiple strings/sequences using multiple patterns in linear time. Using 15 known HIF-1 target gene sequences as a training set, we extracted 105 common patterns that all occur in the 15 training genes using suffix trees. Using these 105 common patterns along with known subsequences surrounding HIF-1 binding sites from the literature, the algorithm searches a genome database that contains 2,078,786 DNA sequences. It reported 258 potentially novel HIF-1 targets including 25 known HIF-1 targets. Based on microarray studies from the literature, 17 putative genes were confirmed to be upregulated by HIF-1 or hypoxia inside these 258 genes. We further studied one of the potential targets, COX-2, in the biological lab; and showed that it was a biologically relevant HIF-1 target. These results demonstrate that our methodology is an effective computational approach for identifying novel HIF-1 targets. |
format | Text |
id | pubmed-2664698 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-26646982009-04-07 An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets Jiang, Yue Cukic, Bojan Adjeroh, Donald A. Skinner, Heath D. Lin, Jie Shen, Qingxi J. Jiang, Bing-Hua Cancer Inform Original Research Efficient and effective analysis of the growing genomic databases requires the development of adequate computational tools. We introduce a fast method based on the suffix tree data structure for predicting novel targets of hypoxia-inducible factor 1 (HIF-1) from huge genome databases. The suffix tree data structure has two powerful applications here: one is to extract unknown patterns from multiple strings/sequences in linear time; the other is to search multiple strings/sequences using multiple patterns in linear time. Using 15 known HIF-1 target gene sequences as a training set, we extracted 105 common patterns that all occur in the 15 training genes using suffix trees. Using these 105 common patterns along with known subsequences surrounding HIF-1 binding sites from the literature, the algorithm searches a genome database that contains 2,078,786 DNA sequences. It reported 258 potentially novel HIF-1 targets including 25 known HIF-1 targets. Based on microarray studies from the literature, 17 putative genes were confirmed to be upregulated by HIF-1 or hypoxia inside these 258 genes. We further studied one of the potential targets, COX-2, in the biological lab; and showed that it was a biologically relevant HIF-1 target. These results demonstrate that our methodology is an effective computational approach for identifying novel HIF-1 targets. Libertas Academica 2009-03-04 /pmc/articles/PMC2664698/ /pubmed/19352460 Text en © 2009 The authors. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/). |
spellingShingle | Original Research Jiang, Yue Cukic, Bojan Adjeroh, Donald A. Skinner, Heath D. Lin, Jie Shen, Qingxi J. Jiang, Bing-Hua An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets |
title | An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets |
title_full | An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets |
title_fullStr | An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets |
title_full_unstemmed | An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets |
title_short | An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets |
title_sort | algorithm for identifying novel targets of transcription factor families: application to hypoxia-inducible factor 1 targets |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2664698/ https://www.ncbi.nlm.nih.gov/pubmed/19352460 |
work_keys_str_mv | AT jiangyue analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT cukicbojan analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT adjerohdonalda analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT skinnerheathd analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT linjie analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT shenqingxij analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT jiangbinghua analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT jiangyue algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT cukicbojan algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT adjerohdonalda algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT skinnerheathd algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT linjie algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT shenqingxij algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets AT jiangbinghua algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets |