Cargando…

A new systematic computational approach to predicting target genes of transcription factors

Identifying transcription factor target genes (TFTGs) is a vital step towards understanding regulatory mechanisms of gene expression. Methods for the de novo identification of TFTGs are generally based on screening for novel DNA binding sites. However, experimental screening of new binding sites is...

Descripción completa

Detalles Bibliográficos
Autores principales: Dai, Xinbin, He, Ji, Zhao, Xuechun
Formato: Texto
Lenguaje:English
Publicado: Oxford University Press 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1935008/
https://www.ncbi.nlm.nih.gov/pubmed/17576669
http://dx.doi.org/10.1093/nar/gkm454
_version_ 1782134363746467840
author Dai, Xinbin
He, Ji
Zhao, Xuechun
author_facet Dai, Xinbin
He, Ji
Zhao, Xuechun
author_sort Dai, Xinbin
collection PubMed
description Identifying transcription factor target genes (TFTGs) is a vital step towards understanding regulatory mechanisms of gene expression. Methods for the de novo identification of TFTGs are generally based on screening for novel DNA binding sites. However, experimental screening of new binding sites is a technically challenging, laborious and time-consuming task, while computational methods still lack accuracy. We propose a novel systematic computational approach for predicting TFTGs directly on a genome scale. Utilizing gene co-expression data, we modeled the prediction problem as a ‘yes’ or ‘no’ classification task by converting biological sequences into novel reverse-complementary position-sensitive n-gram profiles and implemented the classifiers with support vector machines. Our approach does not necessarily predict new DNA binding sites, which other studies have shown to be difficult and inaccurate. We applied the proposed approach to predict auxin-response factor target genes from published Arabidopsis thaliana co-expression data and obtained satisfactory results. Using ten-fold cross validations, the area under curve value of the receiver operating characteristic reaches around 0.73.
format Text
id pubmed-1935008
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-19350082007-08-07 A new systematic computational approach to predicting target genes of transcription factors Dai, Xinbin He, Ji Zhao, Xuechun Nucleic Acids Res Computational Biology Identifying transcription factor target genes (TFTGs) is a vital step towards understanding regulatory mechanisms of gene expression. Methods for the de novo identification of TFTGs are generally based on screening for novel DNA binding sites. However, experimental screening of new binding sites is a technically challenging, laborious and time-consuming task, while computational methods still lack accuracy. We propose a novel systematic computational approach for predicting TFTGs directly on a genome scale. Utilizing gene co-expression data, we modeled the prediction problem as a ‘yes’ or ‘no’ classification task by converting biological sequences into novel reverse-complementary position-sensitive n-gram profiles and implemented the classifiers with support vector machines. Our approach does not necessarily predict new DNA binding sites, which other studies have shown to be difficult and inaccurate. We applied the proposed approach to predict auxin-response factor target genes from published Arabidopsis thaliana co-expression data and obtained satisfactory results. Using ten-fold cross validations, the area under curve value of the receiver operating characteristic reaches around 0.73. Oxford University Press 2007-07 2007-06-18 /pmc/articles/PMC1935008/ /pubmed/17576669 http://dx.doi.org/10.1093/nar/gkm454 Text en © 2007 The Author(s) http://creativecommons.org/licenses/by-nc/2.0/uk/ This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/2.0/uk/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Dai, Xinbin
He, Ji
Zhao, Xuechun
A new systematic computational approach to predicting target genes of transcription factors
title A new systematic computational approach to predicting target genes of transcription factors
title_full A new systematic computational approach to predicting target genes of transcription factors
title_fullStr A new systematic computational approach to predicting target genes of transcription factors
title_full_unstemmed A new systematic computational approach to predicting target genes of transcription factors
title_short A new systematic computational approach to predicting target genes of transcription factors
title_sort new systematic computational approach to predicting target genes of transcription factors
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1935008/
https://www.ncbi.nlm.nih.gov/pubmed/17576669
http://dx.doi.org/10.1093/nar/gkm454
work_keys_str_mv AT daixinbin anewsystematiccomputationalapproachtopredictingtargetgenesoftranscriptionfactors
AT heji anewsystematiccomputationalapproachtopredictingtargetgenesoftranscriptionfactors
AT zhaoxuechun anewsystematiccomputationalapproachtopredictingtargetgenesoftranscriptionfactors
AT daixinbin newsystematiccomputationalapproachtopredictingtargetgenesoftranscriptionfactors
AT heji newsystematiccomputationalapproachtopredictingtargetgenesoftranscriptionfactors
AT zhaoxuechun newsystematiccomputationalapproachtopredictingtargetgenesoftranscriptionfactors