Cargando…
TSSPlant: a new tool for prediction of plant Pol II promoters
Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, dev...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5416875/ https://www.ncbi.nlm.nih.gov/pubmed/28082394 http://dx.doi.org/10.1093/nar/gkw1353 |
_version_ | 1783233835183374336 |
---|---|
author | Shahmuradov, Ilham A. Umarov, Ramzan Kh. Solovyev, Victor V. |
author_facet | Shahmuradov, Ilham A. Umarov, Ramzan Kh. Solovyev, Victor V. |
author_sort | Shahmuradov, Ilham A. |
collection | PubMed |
description | Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, development stage and environmental conditions. Such complex promoter structures make their computational identification notoriously difficult. Here, we present TSSPlant, a novel tool that predicts both TATA and TATA-less promoters in sequences of a wide spectrum of plant genomes. The tool was developed by using large promoter collections from ppdb and PlantProm DB. It utilizes eighteen significant compositional and signal features of plant promoter sequences selected in this study, that feed the artificial neural network-based model trained by the backpropagation algorithm. TSSPlant achieves significantly higher accuracy compared to the next best promoter prediction program for both TATA promoters (MCC≃0.84 and F1-score≃0.91 versus MCC≃0.51 and F1-score≃0.71) and TATA-less promoters (MCC≃0.80, F1-score≃0.89 versus MCC≃0.29 and F1-score≃0.50). TSSPlant is available to download as a standalone program at http://www.cbrc.kaust.edu.sa/download/. |
format | Online Article Text |
id | pubmed-5416875 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-54168752017-05-05 TSSPlant: a new tool for prediction of plant Pol II promoters Shahmuradov, Ilham A. Umarov, Ramzan Kh. Solovyev, Victor V. Nucleic Acids Res Methods Online Our current knowledge of eukaryotic promoters indicates their complex architecture that is often composed of numerous functional motifs. Most of known promoters include multiple and in some cases mutually exclusive transcription start sites (TSSs). Moreover, TSS selection depends on cell/tissue, development stage and environmental conditions. Such complex promoter structures make their computational identification notoriously difficult. Here, we present TSSPlant, a novel tool that predicts both TATA and TATA-less promoters in sequences of a wide spectrum of plant genomes. The tool was developed by using large promoter collections from ppdb and PlantProm DB. It utilizes eighteen significant compositional and signal features of plant promoter sequences selected in this study, that feed the artificial neural network-based model trained by the backpropagation algorithm. TSSPlant achieves significantly higher accuracy compared to the next best promoter prediction program for both TATA promoters (MCC≃0.84 and F1-score≃0.91 versus MCC≃0.51 and F1-score≃0.71) and TATA-less promoters (MCC≃0.80, F1-score≃0.89 versus MCC≃0.29 and F1-score≃0.50). TSSPlant is available to download as a standalone program at http://www.cbrc.kaust.edu.sa/download/. Oxford University Press 2017-05-05 2017-01-13 /pmc/articles/PMC5416875/ /pubmed/28082394 http://dx.doi.org/10.1093/nar/gkw1353 Text en © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Shahmuradov, Ilham A. Umarov, Ramzan Kh. Solovyev, Victor V. TSSPlant: a new tool for prediction of plant Pol II promoters |
title | TSSPlant: a new tool for prediction of plant Pol II promoters |
title_full | TSSPlant: a new tool for prediction of plant Pol II promoters |
title_fullStr | TSSPlant: a new tool for prediction of plant Pol II promoters |
title_full_unstemmed | TSSPlant: a new tool for prediction of plant Pol II promoters |
title_short | TSSPlant: a new tool for prediction of plant Pol II promoters |
title_sort | tssplant: a new tool for prediction of plant pol ii promoters |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5416875/ https://www.ncbi.nlm.nih.gov/pubmed/28082394 http://dx.doi.org/10.1093/nar/gkw1353 |
work_keys_str_mv | AT shahmuradovilhama tssplantanewtoolforpredictionofplantpoliipromoters AT umarovramzankh tssplantanewtoolforpredictionofplantpoliipromoters AT solovyevvictorv tssplantanewtoolforpredictionofplantpoliipromoters |