Cargando…

An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets

Efficient and effective analysis of the growing genomic databases requires the development of adequate computational tools. We introduce a fast method based on the suffix tree data structure for predicting novel targets of hypoxia-inducible factor 1 (HIF-1) from huge genome databases. The suffix tre...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Yue, Cukic, Bojan, Adjeroh, Donald A., Skinner, Heath D., Lin, Jie, Shen, Qingxi J., Jiang, Bing-Hua
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2664698/
https://www.ncbi.nlm.nih.gov/pubmed/19352460
_version_ 1782165979294334976
author Jiang, Yue
Cukic, Bojan
Adjeroh, Donald A.
Skinner, Heath D.
Lin, Jie
Shen, Qingxi J.
Jiang, Bing-Hua
author_facet Jiang, Yue
Cukic, Bojan
Adjeroh, Donald A.
Skinner, Heath D.
Lin, Jie
Shen, Qingxi J.
Jiang, Bing-Hua
author_sort Jiang, Yue
collection PubMed
description Efficient and effective analysis of the growing genomic databases requires the development of adequate computational tools. We introduce a fast method based on the suffix tree data structure for predicting novel targets of hypoxia-inducible factor 1 (HIF-1) from huge genome databases. The suffix tree data structure has two powerful applications here: one is to extract unknown patterns from multiple strings/sequences in linear time; the other is to search multiple strings/sequences using multiple patterns in linear time. Using 15 known HIF-1 target gene sequences as a training set, we extracted 105 common patterns that all occur in the 15 training genes using suffix trees. Using these 105 common patterns along with known subsequences surrounding HIF-1 binding sites from the literature, the algorithm searches a genome database that contains 2,078,786 DNA sequences. It reported 258 potentially novel HIF-1 targets including 25 known HIF-1 targets. Based on microarray studies from the literature, 17 putative genes were confirmed to be upregulated by HIF-1 or hypoxia inside these 258 genes. We further studied one of the potential targets, COX-2, in the biological lab; and showed that it was a biologically relevant HIF-1 target. These results demonstrate that our methodology is an effective computational approach for identifying novel HIF-1 targets.
format Text
id pubmed-2664698
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26646982009-04-07 An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets Jiang, Yue Cukic, Bojan Adjeroh, Donald A. Skinner, Heath D. Lin, Jie Shen, Qingxi J. Jiang, Bing-Hua Cancer Inform Original Research Efficient and effective analysis of the growing genomic databases requires the development of adequate computational tools. We introduce a fast method based on the suffix tree data structure for predicting novel targets of hypoxia-inducible factor 1 (HIF-1) from huge genome databases. The suffix tree data structure has two powerful applications here: one is to extract unknown patterns from multiple strings/sequences in linear time; the other is to search multiple strings/sequences using multiple patterns in linear time. Using 15 known HIF-1 target gene sequences as a training set, we extracted 105 common patterns that all occur in the 15 training genes using suffix trees. Using these 105 common patterns along with known subsequences surrounding HIF-1 binding sites from the literature, the algorithm searches a genome database that contains 2,078,786 DNA sequences. It reported 258 potentially novel HIF-1 targets including 25 known HIF-1 targets. Based on microarray studies from the literature, 17 putative genes were confirmed to be upregulated by HIF-1 or hypoxia inside these 258 genes. We further studied one of the potential targets, COX-2, in the biological lab; and showed that it was a biologically relevant HIF-1 target. These results demonstrate that our methodology is an effective computational approach for identifying novel HIF-1 targets. Libertas Academica 2009-03-04 /pmc/articles/PMC2664698/ /pubmed/19352460 Text en © 2009 The authors. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Original Research
Jiang, Yue
Cukic, Bojan
Adjeroh, Donald A.
Skinner, Heath D.
Lin, Jie
Shen, Qingxi J.
Jiang, Bing-Hua
An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets
title An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets
title_full An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets
title_fullStr An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets
title_full_unstemmed An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets
title_short An Algorithm for Identifying Novel Targets of Transcription Factor Families: Application to Hypoxia-inducible Factor 1 Targets
title_sort algorithm for identifying novel targets of transcription factor families: application to hypoxia-inducible factor 1 targets
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2664698/
https://www.ncbi.nlm.nih.gov/pubmed/19352460
work_keys_str_mv AT jiangyue analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT cukicbojan analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT adjerohdonalda analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT skinnerheathd analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT linjie analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT shenqingxij analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT jiangbinghua analgorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT jiangyue algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT cukicbojan algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT adjerohdonalda algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT skinnerheathd algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT linjie algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT shenqingxij algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets
AT jiangbinghua algorithmforidentifyingnoveltargetsoftranscriptionfactorfamiliesapplicationtohypoxiainduciblefactor1targets