Cargando…

Positional Homology in Bacterial Genomes

In comparative genomic studies, syntenic groups of homologous sequence in the same order have been used as supplementary information that can be used in helping to determine the orthology of the compared sequences. The assumption is that ortholo-gous gene copies are more likely to share the same gen...

Descripción completa

Detalles Bibliográficos
Autores principales: Burgetz, Ingrid J., Shariff, Salimah, Pang, Andy, Tillier, Elisabeth R. M.
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674667/
https://www.ncbi.nlm.nih.gov/pubmed/19455203
_version_ 1782166662270681088
author Burgetz, Ingrid J.
Shariff, Salimah
Pang, Andy
Tillier, Elisabeth R. M.
author_facet Burgetz, Ingrid J.
Shariff, Salimah
Pang, Andy
Tillier, Elisabeth R. M.
author_sort Burgetz, Ingrid J.
collection PubMed
description In comparative genomic studies, syntenic groups of homologous sequence in the same order have been used as supplementary information that can be used in helping to determine the orthology of the compared sequences. The assumption is that ortholo-gous gene copies are more likely to share the same genome positions and share the same gene neighbors. In this study we have defined positional homologs as those that also have homologous neighboring genes and we investigated the usefulness of this distinction for bacterial comparative genomics. We considered the identification of positionaly homologous gene pairs in bacterial genomes using protein and DNA sequence level alignments and found that the positional homologs had on average relatively lower rates of substitution at the DNA level (synonymous substitutions) than duplicate homologs in different genomic locations, regardless of the level of protein sequence divergence (measured with non-synonymous substitution rate). Since gene order conservation can indicate accuracy of orthology assignments, we also considered the effect of imposing certain alignment quality requirements on the sensitivity and specificity of identification of protein pairs by BLAST and FASTA when neighboring information is not available and in comparisons where gene order is not conserved. We found that the addition of a stringency filter based on the second best hits was an efficient way to remove dubious ortholog identifications in BLAST and FASTA analyses. Gene order conservation and DNA sequence homology are useful to consider in comparative genomic studies as they may indicate different orthology assignments than protein sequence homology alone.
format Text
id pubmed-2674667
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-26746672009-05-19 Positional Homology in Bacterial Genomes Burgetz, Ingrid J. Shariff, Salimah Pang, Andy Tillier, Elisabeth R. M. Evol Bioinform Online Original Research In comparative genomic studies, syntenic groups of homologous sequence in the same order have been used as supplementary information that can be used in helping to determine the orthology of the compared sequences. The assumption is that ortholo-gous gene copies are more likely to share the same genome positions and share the same gene neighbors. In this study we have defined positional homologs as those that also have homologous neighboring genes and we investigated the usefulness of this distinction for bacterial comparative genomics. We considered the identification of positionaly homologous gene pairs in bacterial genomes using protein and DNA sequence level alignments and found that the positional homologs had on average relatively lower rates of substitution at the DNA level (synonymous substitutions) than duplicate homologs in different genomic locations, regardless of the level of protein sequence divergence (measured with non-synonymous substitution rate). Since gene order conservation can indicate accuracy of orthology assignments, we also considered the effect of imposing certain alignment quality requirements on the sensitivity and specificity of identification of protein pairs by BLAST and FASTA when neighboring information is not available and in comparisons where gene order is not conserved. We found that the addition of a stringency filter based on the second best hits was an efficient way to remove dubious ortholog identifications in BLAST and FASTA analyses. Gene order conservation and DNA sequence homology are useful to consider in comparative genomic studies as they may indicate different orthology assignments than protein sequence homology alone. Libertas Academica 2007-01-14 /pmc/articles/PMC2674667/ /pubmed/19455203 Text en Copyright © 2006 The authors. http://creativecommons.org/licenses/by/3.0 This article is published under the Creative Commons Attribution By licence. For further information go to: http://creativecommons.org/licenses/by/3.0. (http://creativecommons.org/licenses/by/3.0)
spellingShingle Original Research
Burgetz, Ingrid J.
Shariff, Salimah
Pang, Andy
Tillier, Elisabeth R. M.
Positional Homology in Bacterial Genomes
title Positional Homology in Bacterial Genomes
title_full Positional Homology in Bacterial Genomes
title_fullStr Positional Homology in Bacterial Genomes
title_full_unstemmed Positional Homology in Bacterial Genomes
title_short Positional Homology in Bacterial Genomes
title_sort positional homology in bacterial genomes
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2674667/
https://www.ncbi.nlm.nih.gov/pubmed/19455203
work_keys_str_mv AT burgetzingridj positionalhomologyinbacterialgenomes
AT shariffsalimah positionalhomologyinbacterialgenomes
AT pangandy positionalhomologyinbacterialgenomes
AT tillierelisabethrm positionalhomologyinbacterialgenomes