Cargando…
RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases
BACKGROUND: In bacterial genomes, there are two mechanisms to terminate the DNA transcription: the “intrinsic” or Rho-independent termination and the Rho-dependent termination. Intrinsic terminators are characterized by a RNA hairpin followed by a run of 6–8 U residues relatively easy to identify us...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6407284/ https://www.ncbi.nlm.nih.gov/pubmed/30845912 http://dx.doi.org/10.1186/s12859-019-2704-x |
_version_ | 1783401519036497920 |
---|---|
author | Di Salvo, Marco Puccio, Simone Peano, Clelia Lacour, Stephan Alifano, Pietro |
author_facet | Di Salvo, Marco Puccio, Simone Peano, Clelia Lacour, Stephan Alifano, Pietro |
author_sort | Di Salvo, Marco |
collection | PubMed |
description | BACKGROUND: In bacterial genomes, there are two mechanisms to terminate the DNA transcription: the “intrinsic” or Rho-independent termination and the Rho-dependent termination. Intrinsic terminators are characterized by a RNA hairpin followed by a run of 6–8 U residues relatively easy to identify using one of the numerous available prediction programs. In contrast, Rho-dependent termination is mediated by the Rho protein factor that, firstly, binds to ribosome-free mRNA in a site characterized by a C > G content and then reaches the RNA polymerase to induce its release. Conversely on intrinsic terminators, the computational prediction of Rho-dependent terminators in prokaryotes is a very difficult problem because the sequence features required for the function of Rho are complex and poorly defined. This is the reason why it still does not exist an exhaustive Rho-dependent terminators prediction program. RESULTS: In this study we introduce RhoTermPredict, the first published algorithm for an exhaustive Rho-dependent terminators prediction in bacterial genomes. RhoTermPredict identifies these elements based on a previously proposed consensus motif common to all Rho-dependent transcription terminators. It essentially searches for a 78 nt long RUT site characterized by a C > G content and with regularly spaced C residues, followed by a putative pause site for the RNA polymerase. We tested RhoTermPredict performances by using available genomic and transcriptomic data of the microorganism Escherichia coli K-12, both in limited-length sequences and in the whole-genome, and available genomic sequences from Bacillus subtilis 168 and Salmonella enterica LT2 genomes. We also estimated the overlap between the predictions of RhoTermPredict and those obtained by the predictor of intrinsic terminators ARNold webtool. Our results demonstrated that RhoTermPredict is a very performing algorithm both for limited-length sequences (F(1)-score obtained about 0.7) and for a genome-wide analysis. Furthermore the degree of overlap with ARNold predictions was very low. CONCLUSIONS: Our analysis shows that RhoTermPredict is a powerful tool for Rho-dependent terminators search in the three analyzed genomes and could fill this gap in computational genomics. We conclude that RhoTermPredict could be used in combination with an intrinsic terminators predictor in order to predict all the transcription terminators in bacterial genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2704-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6407284 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-64072842019-03-21 RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases Di Salvo, Marco Puccio, Simone Peano, Clelia Lacour, Stephan Alifano, Pietro BMC Bioinformatics Software BACKGROUND: In bacterial genomes, there are two mechanisms to terminate the DNA transcription: the “intrinsic” or Rho-independent termination and the Rho-dependent termination. Intrinsic terminators are characterized by a RNA hairpin followed by a run of 6–8 U residues relatively easy to identify using one of the numerous available prediction programs. In contrast, Rho-dependent termination is mediated by the Rho protein factor that, firstly, binds to ribosome-free mRNA in a site characterized by a C > G content and then reaches the RNA polymerase to induce its release. Conversely on intrinsic terminators, the computational prediction of Rho-dependent terminators in prokaryotes is a very difficult problem because the sequence features required for the function of Rho are complex and poorly defined. This is the reason why it still does not exist an exhaustive Rho-dependent terminators prediction program. RESULTS: In this study we introduce RhoTermPredict, the first published algorithm for an exhaustive Rho-dependent terminators prediction in bacterial genomes. RhoTermPredict identifies these elements based on a previously proposed consensus motif common to all Rho-dependent transcription terminators. It essentially searches for a 78 nt long RUT site characterized by a C > G content and with regularly spaced C residues, followed by a putative pause site for the RNA polymerase. We tested RhoTermPredict performances by using available genomic and transcriptomic data of the microorganism Escherichia coli K-12, both in limited-length sequences and in the whole-genome, and available genomic sequences from Bacillus subtilis 168 and Salmonella enterica LT2 genomes. We also estimated the overlap between the predictions of RhoTermPredict and those obtained by the predictor of intrinsic terminators ARNold webtool. Our results demonstrated that RhoTermPredict is a very performing algorithm both for limited-length sequences (F(1)-score obtained about 0.7) and for a genome-wide analysis. Furthermore the degree of overlap with ARNold predictions was very low. CONCLUSIONS: Our analysis shows that RhoTermPredict is a powerful tool for Rho-dependent terminators search in the three analyzed genomes and could fill this gap in computational genomics. We conclude that RhoTermPredict could be used in combination with an intrinsic terminators predictor in order to predict all the transcription terminators in bacterial genomes. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12859-019-2704-x) contains supplementary material, which is available to authorized users. BioMed Central 2019-03-07 /pmc/articles/PMC6407284/ /pubmed/30845912 http://dx.doi.org/10.1186/s12859-019-2704-x Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Di Salvo, Marco Puccio, Simone Peano, Clelia Lacour, Stephan Alifano, Pietro RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases |
title | RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases |
title_full | RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases |
title_fullStr | RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases |
title_full_unstemmed | RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases |
title_short | RhoTermPredict: an algorithm for predicting Rho-dependent transcription terminators based on Escherichia coli, Bacillus subtilis and Salmonella enterica databases |
title_sort | rhotermpredict: an algorithm for predicting rho-dependent transcription terminators based on escherichia coli, bacillus subtilis and salmonella enterica databases |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6407284/ https://www.ncbi.nlm.nih.gov/pubmed/30845912 http://dx.doi.org/10.1186/s12859-019-2704-x |
work_keys_str_mv | AT disalvomarco rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases AT pucciosimone rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases AT peanoclelia rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases AT lacourstephan rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases AT alifanopietro rhotermpredictanalgorithmforpredictingrhodependenttranscriptionterminatorsbasedonescherichiacolibacillussubtilisandsalmonellaentericadatabases |