Cargando…

An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes

BACKGROUND: Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficul...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Bingqiang, Zhang, Hanyuan, Zhou, Chuan, Li, Guojun, Fennell, Anne, Wang, Guanghui, Kang, Yu, Liu, Qi, Ma, Qin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977642/
https://www.ncbi.nlm.nih.gov/pubmed/27507169
http://dx.doi.org/10.1186/s12864-016-2982-x
_version_ 1782447067101134848
author Liu, Bingqiang
Zhang, Hanyuan
Zhou, Chuan
Li, Guojun
Fennell, Anne
Wang, Guanghui
Kang, Yu
Liu, Qi
Ma, Qin
author_facet Liu, Bingqiang
Zhang, Hanyuan
Zhou, Chuan
Li, Guojun
Fennell, Anne
Wang, Guanghui
Kang, Yu
Liu, Qi
Ma, Qin
author_sort Liu, Bingqiang
collection PubMed
description BACKGROUND: Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. RESULTS: Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. CONCLUSION: The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2982-x) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4977642
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49776422016-08-10 An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes Liu, Bingqiang Zhang, Hanyuan Zhou, Chuan Li, Guojun Fennell, Anne Wang, Guanghui Kang, Yu Liu, Qi Ma, Qin BMC Genomics Methodology Article BACKGROUND: Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. RESULTS: Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. CONCLUSION: The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance progress in elucidating transcription regulation mechanism, thus provide benefit to the genomic research community and prokaryotic genome researchers in particular. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2982-x) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-09 /pmc/articles/PMC4977642/ /pubmed/27507169 http://dx.doi.org/10.1186/s12864-016-2982-x Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Liu, Bingqiang
Zhang, Hanyuan
Zhou, Chuan
Li, Guojun
Fennell, Anne
Wang, Guanghui
Kang, Yu
Liu, Qi
Ma, Qin
An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
title An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
title_full An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
title_fullStr An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
title_full_unstemmed An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
title_short An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
title_sort integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4977642/
https://www.ncbi.nlm.nih.gov/pubmed/27507169
http://dx.doi.org/10.1186/s12864-016-2982-x
work_keys_str_mv AT liubingqiang anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT zhanghanyuan anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT zhouchuan anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT liguojun anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT fennellanne anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT wangguanghui anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT kangyu anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT liuqi anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT maqin anintegrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT liubingqiang integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT zhanghanyuan integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT zhouchuan integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT liguojun integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT fennellanne integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT wangguanghui integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT kangyu integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT liuqi integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes
AT maqin integrativeandapplicablephylogeneticfootprintingframeworkforcisregulatorymotifsidentificationinprokaryoticgenomes