Cargando…

TSSPlant: a new tool for prediction of plant Pol II promoters

Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, dev...

Descripción completa

Detalles Bibliográficos
Autores principales: Shahmuradov, Ilham A., Umarov, Ramzan Kh., Solovyev, Victor V.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5416875/
https://www.ncbi.nlm.nih.gov/pubmed/28082394
http://dx.doi.org/10.1093/nar/gkw1353
_version_ 1783233835183374336
author Shahmuradov, Ilham A.
Umarov, Ramzan Kh.
Solovyev, Victor V.
author_facet Shahmuradov, Ilham A.
Umarov, Ramzan Kh.
Solovyev, Victor V.
author_sort Shahmuradov, Ilham A.
collection PubMed
description Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, development stage and environmental conditions. Such complex promoter structures make their computational identification notoriously difficult. Here, we present TSSPlant, a novel tool that predicts both TATA and TATA-less promoters in sequences of a wide spectrum of plant genomes. The tool was developed by using large promoter collections from ppdb and PlantProm DB. It utilizes eighteen significant compositional and signal features of plant promoter sequences selected in this study, that feed the artificial neural network-based model trained by the backpropagation algorithm. TSSPlant achieves significantly higher accuracy compared to the next best promoter prediction program for both TATA promoters (MCC≃0.84 and F1-score≃0.91 versus MCC≃0.51 and F1-score≃0.71) and TATA-less promoters (MCC≃0.80, F1-score≃0.89 versus MCC≃0.29 and F1-score≃0.50). TSSPlant is available to download as a standalone program at http://www.cbrc.kaust.edu.sa/download/.
format Online
Article
Text
id pubmed-5416875
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-54168752017-05-05 TSSPlant: a new tool for prediction of plant Pol II promoters Shahmuradov, Ilham A. Umarov, Ramzan Kh. Solovyev, Victor V. Nucleic Acids Res Methods Online Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, development stage and environmental conditions. Such complex promoter structures make their computational identification notoriously difficult. Here, we present TSSPlant, a novel tool that predicts both TATA and TATA-less promoters in sequences of a wide spectrum of plant genomes. The tool was developed by using large promoter collections from ppdb and PlantProm DB. It utilizes eighteen significant compositional and signal features of plant promoter sequences selected in this study, that feed the artificial neural network-based model trained by the backpropagation algorithm. TSSPlant achieves significantly higher accuracy compared to the next best promoter prediction program for both TATA promoters (MCC≃0.84 and F1-score≃0.91 versus MCC≃0.51 and F1-score≃0.71) and TATA-less promoters (MCC≃0.80, F1-score≃0.89 versus MCC≃0.29 and F1-score≃0.50). TSSPlant is available to download as a standalone program at http://www.cbrc.kaust.edu.sa/download/. Oxford University Press 2017-05-05 2017-01-13 /pmc/articles/PMC5416875/ /pubmed/28082394 http://dx.doi.org/10.1093/nar/gkw1353 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Methods Online
Shahmuradov, Ilham A.
Umarov, Ramzan Kh.
Solovyev, Victor V.
TSSPlant: a new tool for prediction of plant Pol II promoters
title TSSPlant: a new tool for prediction of plant Pol II promoters
title_full TSSPlant: a new tool for prediction of plant Pol II promoters
title_fullStr TSSPlant: a new tool for prediction of plant Pol II promoters
title_full_unstemmed TSSPlant: a new tool for prediction of plant Pol II promoters
title_short TSSPlant: a new tool for prediction of plant Pol II promoters
title_sort tssplant: a new tool for prediction of plant pol ii promoters
topic Methods Online
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5416875/
https://www.ncbi.nlm.nih.gov/pubmed/28082394
http://dx.doi.org/10.1093/nar/gkw1353
work_keys_str_mv AT shahmuradovilhama tssplantanewtoolforpredictionofplantpoliipromoters
AT umarovramzankh tssplantanewtoolforpredictionofplantpoliipromoters
AT solovyevvictorv tssplantanewtoolforpredictionofplantpoliipromoters