Cargando…

RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases

BACKGROUND: In bacterial genomes, there are two mechanisms to terminate the DNA transcription: the “intrinsic” or Rho-independent termination and the Rho-dependent termination. Intrinsic terminators are characterized by a RNA hairpin followed by a run of 6–8 U residues relatively easy to identify us...

Descripción completa

Detalles Bibliográficos
Autores principales: Di Salvo, Marco, Puccio, Simone, Peano, Clelia, Lacour, Stephan, Alifano, Pietro
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6407284/
https://www.ncbi.nlm.nih.gov/pubmed/30845912
http://dx.doi.org/10.1186/s12859-019-2704-x
_version_ 1783401519036497920
author Di Salvo, Marco
Puccio, Simone
Peano, Clelia
Lacour, Stephan
Alifano, Pietro
author_facet Di Salvo, Marco
Puccio, Simone
Peano, Clelia
Lacour, Stephan
Alifano, Pietro
author_sort Di Salvo, Marco
collection PubMed
description BACKGROUND: In bacterial genomes, there are two mechanisms to terminate the DNA transcription: the “intrinsic” or Rho-independent termination and the Rho-dependent termination. Intrinsic terminators are characterized by a RNA hairpin followed by a run of 6–8 U residues relatively easy to identify using one of the numerous available prediction programs. In contrast, Rho-dependent termination is mediated by the Rho protein factor that, firstly, binds to ribosome-free mRNA in a site characterized by a C > G content and then reaches the RNA polymerase to induce its release. Conversely on intrinsic terminators, the computational prediction of Rho-dependent terminators in prokaryotes is a very difficult problem because the sequence features required for the function of Rho are complex and poorly defined. This is the reason why it still does not exist an exhaustive Rho-dependent terminators prediction program. RESULTS: In this study we introduce RhoTermPredict, the first published algorithm for an exhaustive Rho-dependent terminators prediction in bacterial genomes. RhoTermPredict identifies these elements based on a previously proposed consensus motif common to all Rho-dependent transcription terminators. It essentially searches for a 78 nt long RUT site characterized by a C > G content and with regularly spaced C residues, followed by a putative pause site for the RNA polymerase. We tested RhoTermPredict performances by using available genomic and transcriptomic data of the microorganism Escherichia coli K-12, both in limited-length sequences and in the whole-genome, and available genomic sequences from Bacillus subtilis 168 and Salmonella enterica LT2 genomes. We also estimated the overlap between the predictions of RhoTermPredict and those obtained by the predictor of intrinsic terminators ARNold webtool. Our results demonstrated that RhoTermPredict is a very performing algorithm both for limited-length sequences (F(1)-score obtained about 0.7) and for a genome-wide analysis. Furthermore the degree of overlap with ARNold predictions was very low. CONCLUSIONS: Our analysis shows that RhoTermPredict is a powerful tool for Rho-dependent terminators search in the three analyzed genomes and could fill this gap in computational genomics. We conclude that RhoTermPredict could be used in combination with an intrinsic terminators predictor in order to predict all the transcription terminators in bacterial genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2704-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6407284
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64072842019-03-21 RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases Di Salvo, Marco Puccio, Simone Peano, Clelia Lacour, Stephan Alifano, Pietro BMC Bioinformatics Software BACKGROUND: In bacterial genomes, there are two mechanisms to terminate the DNA transcription: the “intrinsic” or Rho-independent termination and the Rho-dependent termination. Intrinsic terminators are characterized by a RNA hairpin followed by a run of 6–8 U residues relatively easy to identify using one of the numerous available prediction programs. In contrast, Rho-dependent termination is mediated by the Rho protein factor that, firstly, binds to ribosome-free mRNA in a site characterized by a C > G content and then reaches the RNA polymerase to induce its release. Conversely on intrinsic terminators, the computational prediction of Rho-dependent terminators in prokaryotes is a very difficult problem because the sequence features required for the function of Rho are complex and poorly defined. This is the reason why it still does not exist an exhaustive Rho-dependent terminators prediction program. RESULTS: In this study we introduce RhoTermPredict, the first published algorithm for an exhaustive Rho-dependent terminators prediction in bacterial genomes. RhoTermPredict identifies these elements based on a previously proposed consensus motif common to all Rho-dependent transcription terminators. It essentially searches for a 78 nt long RUT site characterized by a C > G content and with regularly spaced C residues, followed by a putative pause site for the RNA polymerase. We tested RhoTermPredict performances by using available genomic and transcriptomic data of the microorganism Escherichia coli K-12, both in limited-length sequences and in the whole-genome, and available genomic sequences from Bacillus subtilis 168 and Salmonella enterica LT2 genomes. We also estimated the overlap between the predictions of RhoTermPredict and those obtained by the predictor of intrinsic terminators ARNold webtool. Our results demonstrated that RhoTermPredict is a very performing algorithm both for limited-length sequences (F(1)-score obtained about 0.7) and for a genome-wide analysis. Furthermore the degree of overlap with ARNold predictions was very low. CONCLUSIONS: Our analysis shows that RhoTermPredict is a powerful tool for Rho-dependent terminators search in the three analyzed genomes and could fill this gap in computational genomics. We conclude that RhoTermPredict could be used in combination with an intrinsic terminators predictor in order to predict all the transcription terminators in bacterial genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2704-x) contains supplementary material, which is available to authorized users. BioMed Central 2019-03-07 /pmc/articles/PMC6407284/ /pubmed/30845912 http://dx.doi.org/10.1186/s12859-019-2704-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Software
Di Salvo, Marco
Puccio, Simone
Peano, Clelia
Lacour, Stephan
Alifano, Pietro
RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases
title RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases
title_full RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases
title_fullStr RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases
title_full_unstemmed RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases
title_short RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases
title_sort rhotermpredict: an algorithm for predicting rho-dependent transcription terminators based on escherichia coli, bacillus subtilis and salmonella enterica databases
topic Software
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6407284/
https://www.ncbi.nlm.nih.gov/pubmed/30845912
http://dx.doi.org/10.1186/s12859-019-2704-x
work_keys_str_mv AT disalvomarco rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases
AT pucciosimone rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases
AT peanoclelia rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases
AT lacourstephan rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases
AT alifanopietro rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases