Cargando…
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
BACKGROUND: Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficul...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977642/ https://www.ncbi.nlm.nih.gov/pubmed/27507169 http://dx.doi.org/10.1186/s12864-016-2982-x |
_version_ | 1782447067101134848 |
---|---|
author | Liu, Bingqiang Zhang, Hanyuan Zhou, Chuan Li, Guojun Fennell, Anne Wang, Guanghui Kang, Yu Liu, Qi Ma, Qin |
author_facet | Liu, Bingqiang Zhang, Hanyuan Zhou, Chuan Li, Guojun Fennell, Anne Wang, Guanghui Kang, Yu Liu, Qi Ma, Qin |
author_sort | Liu, Bingqiang |
collection | PubMed |
description | BACKGROUND: Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. RESULTS: Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. CONCLUSION: The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2982-x) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4977642 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49776422016-08-10 An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes Liu, Bingqiang Zhang, Hanyuan Zhou, Chuan Li, Guojun Fennell, Anne Wang, Guanghui Kang, Yu Liu, Qi Ma, Qin BMC Genomics Methodology Article BACKGROUND: Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. RESULTS: Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. CONCLUSION: The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2982-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-09 /pmc/articles/PMC4977642/ /pubmed/27507169 http://dx.doi.org/10.1186/s12864-016-2982-x Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Liu, Bingqiang Zhang, Hanyuan Zhou, Chuan Li, Guojun Fennell, Anne Wang, Guanghui Kang, Yu Liu, Qi Ma, Qin An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes |
title | An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes |
title_full | An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes |
title_fullStr | An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes |
title_full_unstemmed | An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes |
title_short | An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes |
title_sort | integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977642/ https://www.ncbi.nlm.nih.gov/pubmed/27507169 http://dx.doi.org/10.1186/s12864-016-2982-x |
work_keys_str_mv | AT liubingqiang anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT zhanghanyuan anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT zhouchuan anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT liguojun anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT fennellanne anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT wangguanghui anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT kangyu anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT liuqi anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT maqin anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT liubingqiang integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT zhanghanyuan integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT zhouchuan integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT liguojun integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT fennellanne integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT wangguanghui integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT kangyu integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT liuqi integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes AT maqin integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes |