Cargando…
A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions
A novel ab initio parameter-tuning-free system to identify transcriptional factor (TF) binding motifs (TFBMs) in genome DNA sequences was developed. It is based on the comparison of two types of frequency distributions with respect to the TFBM candidates in the target DNA sequences and the non-candi...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3467046/ https://www.ncbi.nlm.nih.gov/pubmed/22798493 http://dx.doi.org/10.1093/nar/gks642 |
_version_ | 1782245734271156224 |
---|---|
author | Nakaki, Ryo Kang, Jiyoung Tateno, Masaru |
author_facet | Nakaki, Ryo Kang, Jiyoung Tateno, Masaru |
author_sort | Nakaki, Ryo |
collection | PubMed |
description | A novel ab initio parameter-tuning-free system to identify transcriptional factor (TF) binding motifs (TFBMs) in genome DNA sequences was developed. It is based on the comparison of two types of frequency distributions with respect to the TFBM candidates in the target DNA sequences and the non-candidates in the background sequence, with the latter generated by utilizing the intergenic sequences. For benchmark tests, we used DNA sequence datasets extracted by ChIP-on-chip and ChIP-seq techniques and identified 65 yeast and four mammalian TFBMs, with the latter including gaps. The accuracy of our system was compared with those of other available programs (i.e. MEME, Weeder, BioProspector, MDscan and DME) and was the best among them, even without tuning of the parameter set for each TFBM and pre-treatment/editing of the target DNA sequences. Moreover, with respect to some TFs for which the identified motifs are inconsistent with those in the references, our results were revealed to be correct, by comparing them with other existing experimental data. Thus, our identification system does not need any other biological information except for gene positions, and is also expected to be applicable to genome DNA sequences to identify unknown TFBMs as well as known ones. |
format | Online Article Text |
id | pubmed-3467046 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-34670462012-10-10 A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions Nakaki, Ryo Kang, Jiyoung Tateno, Masaru Nucleic Acids Res Computational Biology A novel ab initio parameter-tuning-free system to identify transcriptional factor (TF) binding motifs (TFBMs) in genome DNA sequences was developed. It is based on the comparison of two types of frequency distributions with respect to the TFBM candidates in the target DNA sequences and the non-candidates in the background sequence, with the latter generated by utilizing the intergenic sequences. For benchmark tests, we used DNA sequence datasets extracted by ChIP-on-chip and ChIP-seq techniques and identified 65 yeast and four mammalian TFBMs, with the latter including gaps. The accuracy of our system was compared with those of other available programs (i.e. MEME, Weeder, BioProspector, MDscan and DME) and was the best among them, even without tuning of the parameter set for each TFBM and pre-treatment/editing of the target DNA sequences. Moreover, with respect to some TFs for which the identified motifs are inconsistent with those in the references, our results were revealed to be correct, by comparing them with other existing experimental data. Thus, our identification system does not need any other biological information except for gene positions, and is also expected to be applicable to genome DNA sequences to identify unknown TFBMs as well as known ones. Oxford University Press 2012-10 2012-07-13 /pmc/articles/PMC3467046/ /pubmed/22798493 http://dx.doi.org/10.1093/nar/gks642 Text en © The Author(s) 2012. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/3.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Computational Biology Nakaki, Ryo Kang, Jiyoung Tateno, Masaru A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions |
title | A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions |
title_full | A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions |
title_fullStr | A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions |
title_full_unstemmed | A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions |
title_short | A novel ab initio identification system of transcriptional regulation motifs in genome DNA sequences based on direct comparison scheme of signal/noise distributions |
title_sort | novel ab initio identification system of transcriptional regulation motifs in genome dna sequences based on direct comparison scheme of signal/noise distributions |
topic | Computational Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3467046/ https://www.ncbi.nlm.nih.gov/pubmed/22798493 http://dx.doi.org/10.1093/nar/gks642 |
work_keys_str_mv | AT nakakiryo anovelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions AT kangjiyoung anovelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions AT tatenomasaru anovelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions AT nakakiryo novelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions AT kangjiyoung novelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions AT tatenomasaru novelabinitioidentificationsystemoftranscriptionalregulationmotifsingenomednasequencesbasedondirectcomparisonschemeofsignalnoisedistributions |