Cargando…

A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes

During microbial evolution, genome rearrangement increases with increasing sequence divergence. If the relationship between synteny and sequence divergence can be modeled, gene clusters in genomes of distantly related organisms exhibiting anomalous synteny can be identified and used to infer functio...

Descripción completa

Detalles Bibliográficos
Autores principales: Yelton, Alexis P., Thomas, Brian C., Simmons, Sheri L., Wilmes, Paul, Zemla, Adam, Thelen, Michael P., Justice, Nicholas, Banfield, Jillian F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197636/
https://www.ncbi.nlm.nih.gov/pubmed/22028637
http://dx.doi.org/10.1371/journal.pcbi.1002230
_version_ 1782214344746991616
author Yelton, Alexis P.
Thomas, Brian C.
Simmons, Sheri L.
Wilmes, Paul
Zemla, Adam
Thelen, Michael P.
Justice, Nicholas
Banfield, Jillian F.
author_facet Yelton, Alexis P.
Thomas, Brian C.
Simmons, Sheri L.
Wilmes, Paul
Zemla, Adam
Thelen, Michael P.
Justice, Nicholas
Banfield, Jillian F.
author_sort Yelton, Alexis P.
collection PubMed
description During microbial evolution, genome rearrangement increases with increasing sequence divergence. If the relationship between synteny and sequence divergence can be modeled, gene clusters in genomes of distantly related organisms exhibiting anomalous synteny can be identified and used to infer functional conservation. We applied the phylogenetic pairwise comparison method to establish and model a strong correlation between synteny and sequence divergence in all 634 available Archaeal and Bacterial genomes from the NCBI database and four newly assembled genomes of uncultivated Archaea from an acid mine drainage (AMD) community. In parallel, we established and modeled the trend between synteny and functional relatedness in the 118 genomes available in the STRING database. By combining these models, we developed a gene functional annotation method that weights evolutionary distance to estimate the probability of functional associations of syntenous proteins between genome pairs. The method was applied to the hypothetical proteins and poorly annotated genes in newly assembled acid mine drainage Archaeal genomes to add or improve gene annotations. This is the first method to assign possible functions to poorly annotated genes through quantification of the probability of gene functional relationships based on synteny at a significant evolutionary distance, and has the potential for broad application.
format Online
Article
Text
id pubmed-3197636
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31976362011-10-25 A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes Yelton, Alexis P. Thomas, Brian C. Simmons, Sheri L. Wilmes, Paul Zemla, Adam Thelen, Michael P. Justice, Nicholas Banfield, Jillian F. PLoS Comput Biol Research Article During microbial evolution, genome rearrangement increases with increasing sequence divergence. If the relationship between synteny and sequence divergence can be modeled, gene clusters in genomes of distantly related organisms exhibiting anomalous synteny can be identified and used to infer functional conservation. We applied the phylogenetic pairwise comparison method to establish and model a strong correlation between synteny and sequence divergence in all 634 available Archaeal and Bacterial genomes from the NCBI database and four newly assembled genomes of uncultivated Archaea from an acid mine drainage (AMD) community. In parallel, we established and modeled the trend between synteny and functional relatedness in the 118 genomes available in the STRING database. By combining these models, we developed a gene functional annotation method that weights evolutionary distance to estimate the probability of functional associations of syntenous proteins between genome pairs. The method was applied to the hypothetical proteins and poorly annotated genes in newly assembled acid mine drainage Archaeal genomes to add or improve gene annotations. This is the first method to assign possible functions to poorly annotated genes through quantification of the probability of gene functional relationships based on synteny at a significant evolutionary distance, and has the potential for broad application. Public Library of Science 2011-10-20 /pmc/articles/PMC3197636/ /pubmed/22028637 http://dx.doi.org/10.1371/journal.pcbi.1002230 Text en Yelton et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Yelton, Alexis P.
Thomas, Brian C.
Simmons, Sheri L.
Wilmes, Paul
Zemla, Adam
Thelen, Michael P.
Justice, Nicholas
Banfield, Jillian F.
A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
title A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
title_full A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
title_fullStr A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
title_full_unstemmed A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
title_short A Semi-Quantitative, Synteny-Based Method to Improve Functional Predictions for Hypothetical and Poorly Annotated Bacterial and Archaeal Genes
title_sort semi-quantitative, synteny-based method to improve functional predictions for hypothetical and poorly annotated bacterial and archaeal genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3197636/
https://www.ncbi.nlm.nih.gov/pubmed/22028637
http://dx.doi.org/10.1371/journal.pcbi.1002230
work_keys_str_mv AT yeltonalexisp asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT thomasbrianc asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT simmonssheril asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT wilmespaul asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT zemlaadam asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT thelenmichaelp asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT justicenicholas asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT banfieldjillianf asemiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT yeltonalexisp semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT thomasbrianc semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT simmonssheril semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT wilmespaul semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT zemlaadam semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT thelenmichaelp semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT justicenicholas semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes
AT banfieldjillianf semiquantitativesyntenybasedmethodtoimprovefunctionalpredictionsforhypotheticalandpoorlyannotatedbacterialandarchaealgenes