Cargando…

Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae

Distinguishing between promoter-like sequences in bacteria that belong to true or abortive promoters, or to those that do not initiate transcription at all, is one of the important challenges in transcriptomics. To address this problem, we have studied the genome-reduced bacterium Mycoplasma pneumon...

Descripción completa

Detalles Bibliográficos
Autores principales: Lloréns-Rico, Verónica, Lluch-Senar, Maria, Serrano, Luis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4402517/
https://www.ncbi.nlm.nih.gov/pubmed/25779052
http://dx.doi.org/10.1093/nar/gkv170
_version_ 1782367267818831872
author Lloréns-Rico, Verónica
Lluch-Senar, Maria
Serrano, Luis
author_facet Lloréns-Rico, Verónica
Lluch-Senar, Maria
Serrano, Luis
author_sort Lloréns-Rico, Verónica
collection PubMed
description Distinguishing between promoter-like sequences in bacteria that belong to true or abortive promoters, or to those that do not initiate transcription at all, is one of the important challenges in transcriptomics. To address this problem, we have studied the genome-reduced bacterium Mycoplasma pneumoniae, for which the RNAs associated with transcriptional start sites have been recently experimentally identified. We determined the contribution to transcription events of different genomic features: the –10, extended –10 and –35 boxes, the UP element, the bases surrounding the –10 box and the nearest-neighbor free energy of the promoter region. Using a random forest classifier and the aforementioned features transformed into scores, we could distinguish between true, abortive promoters and non-promoters with good –10 box sequences. The methods used in this characterization of promoters can be extended to other bacteria and have important applications for promoter design in bacterial genome engineering.
format Online
Article
Text
id pubmed-4402517
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-44025172015-04-29 Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae Lloréns-Rico, Verónica Lluch-Senar, Maria Serrano, Luis Nucleic Acids Res Computational Biology Distinguishing between promoter-like sequences in bacteria that belong to true or abortive promoters, or to those that do not initiate transcription at all, is one of the important challenges in transcriptomics. To address this problem, we have studied the genome-reduced bacterium Mycoplasma pneumoniae, for which the RNAs associated with transcriptional start sites have been recently experimentally identified. We determined the contribution to transcription events of different genomic features: the –10, extended –10 and –35 boxes, the UP element, the bases surrounding the –10 box and the nearest-neighbor free energy of the promoter region. Using a random forest classifier and the aforementioned features transformed into scores, we could distinguish between true, abortive promoters and non-promoters with good –10 box sequences. The methods used in this characterization of promoters can be extended to other bacteria and have important applications for promoter design in bacterial genome engineering. Oxford University Press 2015-04-20 2015-03-16 /pmc/articles/PMC4402517/ /pubmed/25779052 http://dx.doi.org/10.1093/nar/gkv170 Text en © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. http://creativecommons.org/licenses/by-nc/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Computational Biology
Lloréns-Rico, Verónica
Lluch-Senar, Maria
Serrano, Luis
Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae
title Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae
title_full Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae
title_fullStr Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae
title_full_unstemmed Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae
title_short Distinguishing between productive and abortive promoters using a random forest classifier in Mycoplasma pneumoniae
title_sort distinguishing between productive and abortive promoters using a random forest classifier in mycoplasma pneumoniae
topic Computational Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4402517/
https://www.ncbi.nlm.nih.gov/pubmed/25779052
http://dx.doi.org/10.1093/nar/gkv170
work_keys_str_mv AT llorensricoveronica distinguishingbetweenproductiveandabortivepromotersusingarandomforestclassifierinmycoplasmapneumoniae
AT lluchsenarmaria distinguishingbetweenproductiveandabortivepromotersusingarandomforestclassifierinmycoplasmapneumoniae
AT serranoluis distinguishingbetweenproductiveandabortivepromotersusingarandomforestclassifierinmycoplasmapneumoniae