Cargando…
A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
During microbial evolution, genome rearrangement increases with increasing sequence divergence. If the relationship between synteny and sequence divergence can be modeled, gene clusters in genomes of distantly related organisms exhibiting anomalous synteny can be identified and used to infer functio...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197636/ https://www.ncbi.nlm.nih.gov/pubmed/22028637 http://dx.doi.org/10.1371/journal.pcbi.1002230 |
_version_ | 1782214344746991616 |
---|---|
author | Yelton, Alexis P. Thomas, Brian C. Simmons, Sheri L. Wilmes, Paul Zemla, Adam Thelen, Michael P. Justice, Nicholas Banfield, Jillian F. |
author_facet | Yelton, Alexis P. Thomas, Brian C. Simmons, Sheri L. Wilmes, Paul Zemla, Adam Thelen, Michael P. Justice, Nicholas Banfield, Jillian F. |
author_sort | Yelton, Alexis P. |
collection | PubMed |
description | During microbial evolution, genome rearrangement increases with increasing sequence divergence. If the relationship between synteny and sequence divergence can be modeled, gene clusters in genomes of distantly related organisms exhibiting anomalous synteny can be identified and used to infer functional conservation. We applied the phylogenetic pairwise comparison method to establish and model a strong correlation between synteny and sequence divergence in all 634 available Archaeal and Bacterial genomes from the NCBI database and four newly assembled genomes of uncultivated Archaea from an acid mine drainage (AMD) community. In parallel, we established and modeled the trend between synteny and functional relatedness in the 118 genomes available in the STRING database. By combining these models, we developed a gene functional annotation method that weights evolutionary distance to estimate the probability of functional associations of syntenous proteins between genome pairs. The method was applied to the hypothetical proteins and poorly annotated genes in newly assembled acid mine drainage Archaeal genomes to add or improve gene annotations. This is the first method to assign possible functions to poorly annotated genes through quantification of the probability of gene functional relationships based on synteny at a significant evolutionary distance, and has the potential for broad application. |
format | Online Article Text |
id | pubmed-3197636 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-31976362011-10-25 A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes Yelton, Alexis P. Thomas, Brian C. Simmons, Sheri L. Wilmes, Paul Zemla, Adam Thelen, Michael P. Justice, Nicholas Banfield, Jillian F. PLoS Comput Biol Research Article During microbial evolution, genome rearrangement increases with increasing sequence divergence. If the relationship between synteny and sequence divergence can be modeled, gene clusters in genomes of distantly related organisms exhibiting anomalous synteny can be identified and used to infer functional conservation. We applied the phylogenetic pairwise comparison method to establish and model a strong correlation between synteny and sequence divergence in all 634 available Archaeal and Bacterial genomes from the NCBI database and four newly assembled genomes of uncultivated Archaea from an acid mine drainage (AMD) community. In parallel, we established and modeled the trend between synteny and functional relatedness in the 118 genomes available in the STRING database. By combining these models, we developed a gene functional annotation method that weights evolutionary distance to estimate the probability of functional associations of syntenous proteins between genome pairs. The method was applied to the hypothetical proteins and poorly annotated genes in newly assembled acid mine drainage Archaeal genomes to add or improve gene annotations. This is the first method to assign possible functions to poorly annotated genes through quantification of the probability of gene functional relationships based on synteny at a significant evolutionary distance, and has the potential for broad application. Public Library of Science 2011-10-20 /pmc/articles/PMC3197636/ /pubmed/22028637 http://dx.doi.org/10.1371/journal.pcbi.1002230 Text en Yelton et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Yelton, Alexis P. Thomas, Brian C. Simmons, Sheri L. Wilmes, Paul Zemla, Adam Thelen, Michael P. Justice, Nicholas Banfield, Jillian F. A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes |
title | A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes |
title_full | A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes |
title_fullStr | A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes |
title_full_unstemmed | A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes |
title_short | A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes |
title_sort | semi-quantitative, synteny-based method to improve functional predictions for hypothetical and poorly annotated bacterial and archaeal genes |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197636/ https://www.ncbi.nlm.nih.gov/pubmed/22028637 http://dx.doi.org/10.1371/journal.pcbi.1002230 |
work_keys_str_mv | AT yeltonalexisp asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT thomasbrianc asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT simmonssheril asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT wilmespaul asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT zemlaadam asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT thelenmichaelp asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT justicenicholas asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT banfieldjillianf asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT yeltonalexisp semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT thomasbrianc semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT simmonssheril semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT wilmespaul semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT zemlaadam semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT thelenmichaelp semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT justicenicholas semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes AT banfieldjillianf semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes |