Cargando…
Positional Homology in Bacterial Genomes
In comparative genomic studies, syntenic groups of homologous sequence in the same order have been used as supplementary information that can be used in helping to determine the orthology of the compared sequences. The assumption is that ortholo-gous gene copies are more likely to share the same gen...
Autores principales: | , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2007
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674667/ https://www.ncbi.nlm.nih.gov/pubmed/19455203 |
_version_ | 1782166662270681088 |
---|---|
author | Burgetz, Ingrid J. Shariff, Salimah Pang, Andy Tillier, Elisabeth R. M. |
author_facet | Burgetz, Ingrid J. Shariff, Salimah Pang, Andy Tillier, Elisabeth R. M. |
author_sort | Burgetz, Ingrid J. |
collection | PubMed |
description | In comparative genomic studies, syntenic groups of homologous sequence in the same order have been used as supplementary information that can be used in helping to determine the orthology of the compared sequences. The assumption is that ortholo-gous gene copies are more likely to share the same genome positions and share the same gene neighbors. In this study we have defined positional homologs as those that also have homologous neighboring genes and we investigated the usefulness of this distinction for bacterial comparative genomics. We considered the identification of positionaly homologous gene pairs in bacterial genomes using protein and DNA sequence level alignments and found that the positional homologs had on average relatively lower rates of substitution at the DNA level (synonymous substitutions) than duplicate homologs in different genomic locations, regardless of the level of protein sequence divergence (measured with non-synonymous substitution rate). Since gene order conservation can indicate accuracy of orthology assignments, we also considered the effect of imposing certain alignment quality requirements on the sensitivity and specificity of identification of protein pairs by BLAST and FASTA when neighboring information is not available and in comparisons where gene order is not conserved. We found that the addition of a stringency filter based on the second best hits was an efficient way to remove dubious ortholog identifications in BLAST and FASTA analyses. Gene order conservation and DNA sequence homology are useful to consider in comparative genomic studies as they may indicate different orthology assignments than protein sequence homology alone. |
format | Text |
id | pubmed-2674667 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2007 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-26746672009-05-19 Positional Homology in Bacterial Genomes Burgetz, Ingrid J. Shariff, Salimah Pang, Andy Tillier, Elisabeth R. M. Evol Bioinform Online Original Research In comparative genomic studies, syntenic groups of homologous sequence in the same order have been used as supplementary information that can be used in helping to determine the orthology of the compared sequences. The assumption is that ortholo-gous gene copies are more likely to share the same genome positions and share the same gene neighbors. In this study we have defined positional homologs as those that also have homologous neighboring genes and we investigated the usefulness of this distinction for bacterial comparative genomics. We considered the identification of positionaly homologous gene pairs in bacterial genomes using protein and DNA sequence level alignments and found that the positional homologs had on average relatively lower rates of substitution at the DNA level (synonymous substitutions) than duplicate homologs in different genomic locations, regardless of the level of protein sequence divergence (measured with non-synonymous substitution rate). Since gene order conservation can indicate accuracy of orthology assignments, we also considered the effect of imposing certain alignment quality requirements on the sensitivity and specificity of identification of protein pairs by BLAST and FASTA when neighboring information is not available and in comparisons where gene order is not conserved. We found that the addition of a stringency filter based on the second best hits was an efficient way to remove dubious ortholog identifications in BLAST and FASTA analyses. Gene order conservation and DNA sequence homology are useful to consider in comparative genomic studies as they may indicate different orthology assignments than protein sequence homology alone. Libertas Academica 2007-01-14 /pmc/articles/PMC2674667/ /pubmed/19455203 Text en Copyright © 2006 The authors. http://creativecommons.org/licenses/by/3.0 This article is published under the Creative Commons Attribution By licence. For further information go to: http://creativecommons.org/licenses/by/3.0. (http://creativecommons.org/licenses/by/3.0) |
spellingShingle | Original Research Burgetz, Ingrid J. Shariff, Salimah Pang, Andy Tillier, Elisabeth R. M. Positional Homology in Bacterial Genomes |
title | Positional Homology in Bacterial Genomes |
title_full | Positional Homology in Bacterial Genomes |
title_fullStr | Positional Homology in Bacterial Genomes |
title_full_unstemmed | Positional Homology in Bacterial Genomes |
title_short | Positional Homology in Bacterial Genomes |
title_sort | positional homology in bacterial genomes |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674667/ https://www.ncbi.nlm.nih.gov/pubmed/19455203 |
work_keys_str_mv | AT burgetzingridj positionalhomologyinbacterialgenomes AT shariffsalimah positionalhomologyinbacterialgenomes AT pangandy positionalhomologyinbacterialgenomes AT tillierelisabethrm positionalhomologyinbacterialgenomes |