Cargando…

A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions

A novel ab initio parameter-tuning-free system to identify transcriptional factor (TF) binding motifs (TFBMs) in genome DNA sequences was developed. It is based on the comparison of two types of frequency distributions with respect to the TFBM candidates in the target DNA sequences and the non-candi...

Descripción completa

Detalles Bibliográficos
Autores principales: Nakaki, Ryo, Kang, Jiyoung, Tateno, Masaru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3467046/
https://www.ncbi.nlm.nih.gov/pubmed/22798493
http://dx.doi.org/10.1093/nar/gks642
_version_ 1782245734271156224
author Nakaki, Ryo
Kang, Jiyoung
Tateno, Masaru
author_facet Nakaki, Ryo
Kang, Jiyoung
Tateno, Masaru
author_sort Nakaki, Ryo
collection PubMed
description A novel ab initio parameter-tuning-free system to identify transcriptional factor (TF) binding motifs (TFBMs) in genome DNA sequences was developed. It is based on the comparison of two types of frequency distributions with respect to the TFBM candidates in the target DNA sequences and the non-candidates in the background sequence, with the latter generated by utilizing the intergenic sequences. For benchmark tests, we used DNA sequence datasets extracted by ChIP-on-chip and ChIP-seq techniques and identified 65 yeast and four mammalian TFBMs, with the latter including gaps. The accuracy of our system was compared with those of other available programs (i.e. MEME, Weeder, BioProspector, MDscan and DME) and was the best among them, even without tuning of the parameter set for each TFBM and pre-treatment/editing of the target DNA sequences. Moreover, with respect to some TFs for which the identified motifs are inconsistent with those in the references, our results were revealed to be correct, by comparing them with other existing experimental data. Thus, our identification system does not need any other biological information except for gene positions, and is also expected to be applicable to genome DNA sequences to identify unknown TFBMs as well as known ones.
format Online
Article
Text
id pubmed-3467046
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-34670462012-10-10 A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions Nakaki, Ryo Kang, Jiyoung Tateno, Masaru Nucleic Acids Res Computational Biology A novel ab initio parameter-tuning-free system to identify transcriptional factor (TF) binding motifs (TFBMs) in genome DNA sequences was developed. It is based on the comparison of two types of frequency distributions with respect to the TFBM candidates in the target DNA sequences and the non-candidates in the background sequence, with the latter generated by utilizing the intergenic sequences. For benchmark tests, we used DNA sequence datasets extracted by ChIP-on-chip and ChIP-seq techniques and identified 65 yeast and four mammalian TFBMs, with the latter including gaps. The accuracy of our system was compared with those of other available programs (i.e. MEME, Weeder, BioProspector, MDscan and DME) and was the best among them, even without tuning of the parameter set for each TFBM and pre-treatment/editing of the target DNA sequences. Moreover, with respect to some TFs for which the identified motifs are inconsistent with those in the references, our results were revealed to be correct, by comparing them with other existing experimental data. Thus, our identification system does not need any other biological information except for gene positions, and is also expected to be applicable to genome DNA sequences to identify unknown TFBMs as well as known ones. Oxford University Press 2012-10 2012-07-13 /pmc/articles/PMC3467046/ /pubmed/22798493 http://dx.doi.org/10.1093/nar/gks642 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Computational Biology
Nakaki, Ryo
Kang, Jiyoung
Tateno, Masaru
A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions
title A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions
title_full A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions
title_fullStr A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions
title_full_unstemmed A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions
title_short A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions
title_sort novel ab initio identification system of transcriptional regulation motifs in genome dna sequences based on direct comparison scheme of signal/noise distributions
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3467046/
https://www.ncbi.nlm.nih.gov/pubmed/22798493
http://dx.doi.org/10.1093/nar/gks642
work_keys_str_mv AT nakakiryo anovelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions
AT kangjiyoung anovelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions
AT tatenomasaru anovelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions
AT nakakiryo novelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions
AT kangjiyoung novelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions
AT tatenomasaru novelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions