Cargando…
Detecting recombination in evolving nucleotide sequences
BACKGROUND: Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification o...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1592127/ https://www.ncbi.nlm.nih.gov/pubmed/16978423 http://dx.doi.org/10.1186/1471-2105-7-412 |
_version_ | 1782130381945831424 |
---|---|
author | Chan, Cheong Xin Beiko, Robert G Ragan, Mark A |
author_facet | Chan, Cheong Xin Beiko, Robert G Ragan, Mark A |
author_sort | Chan, Cheong Xin |
collection | PubMed |
description | BACKGROUND: Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification of recombination events, little is known about the effects of subsequent substitutions on the accuracy of available recombination-detection approaches. RESULTS: We assessed the effect of subsequent substitutions on the detection of simulated recombination events within sets of four nucleotide sequences under a homogeneous evolutionary model. The amount of subsequent substitutions per site, prior evolutionary history of the sequences, and reciprocality or non-reciprocality of the recombination event all affected the accuracy of the recombination-detecting programs examined. Bayesian phylogenetic-based approaches showed high accuracy in detecting evidence of recombination event and in identifying recombination breakpoints. These approaches were less sensitive to parameter settings than other methods we tested, making them easier to apply to various data sets in a consistent manner. CONCLUSION: Post-recombination substitutions tend to diminish the predictive accuracy of recombination-detecting programs. The best method for detecting recombined regions is not necessarily the most accurate in identifying recombination breakpoints. For difficult detection problems involving highly divergent sequences or large data sets, different types of approach can be run in succession to increase efficiency, and can potentially yield better predictive accuracy than any single method used in isolation. |
format | Text |
id | pubmed-1592127 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15921272006-10-05 Detecting recombination in evolving nucleotide sequences Chan, Cheong Xin Beiko, Robert G Ragan, Mark A BMC Bioinformatics Research Article BACKGROUND: Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification of recombination events, little is known about the effects of subsequent substitutions on the accuracy of available recombination-detection approaches. RESULTS: We assessed the effect of subsequent substitutions on the detection of simulated recombination events within sets of four nucleotide sequences under a homogeneous evolutionary model. The amount of subsequent substitutions per site, prior evolutionary history of the sequences, and reciprocality or non-reciprocality of the recombination event all affected the accuracy of the recombination-detecting programs examined. Bayesian phylogenetic-based approaches showed high accuracy in detecting evidence of recombination event and in identifying recombination breakpoints. These approaches were less sensitive to parameter settings than other methods we tested, making them easier to apply to various data sets in a consistent manner. CONCLUSION: Post-recombination substitutions tend to diminish the predictive accuracy of recombination-detecting programs. The best method for detecting recombined regions is not necessarily the most accurate in identifying recombination breakpoints. For difficult detection problems involving highly divergent sequences or large data sets, different types of approach can be run in succession to increase efficiency, and can potentially yield better predictive accuracy than any single method used in isolation. BioMed Central 2006-09-18 /pmc/articles/PMC1592127/ /pubmed/16978423 http://dx.doi.org/10.1186/1471-2105-7-412 Text en Copyright © 2006 Chan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Chan, Cheong Xin Beiko, Robert G Ragan, Mark A Detecting recombination in evolving nucleotide sequences |
title | Detecting recombination in evolving nucleotide sequences |
title_full | Detecting recombination in evolving nucleotide sequences |
title_fullStr | Detecting recombination in evolving nucleotide sequences |
title_full_unstemmed | Detecting recombination in evolving nucleotide sequences |
title_short | Detecting recombination in evolving nucleotide sequences |
title_sort | detecting recombination in evolving nucleotide sequences |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1592127/ https://www.ncbi.nlm.nih.gov/pubmed/16978423 http://dx.doi.org/10.1186/1471-2105-7-412 |
work_keys_str_mv | AT chancheongxin detectingrecombinationinevolvingnucleotidesequences AT beikorobertg detectingrecombinationinevolvingnucleotidesequences AT raganmarka detectingrecombinationinevolvingnucleotidesequences |