Cargando…

Detecting recombination in evolving nucleotide sequences

BACKGROUND: Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification o...

Descripción completa

Detalles Bibliográficos
Autores principales: Chan, Cheong Xin, Beiko, Robert G, Ragan, Mark A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1592127/
https://www.ncbi.nlm.nih.gov/pubmed/16978423
http://dx.doi.org/10.1186/1471-2105-7-412
_version_ 1782130381945831424
author Chan, Cheong Xin
Beiko, Robert G
Ragan, Mark A
author_facet Chan, Cheong Xin
Beiko, Robert G
Ragan, Mark A
author_sort Chan, Cheong Xin
collection PubMed
description BACKGROUND: Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification of recombination events, little is known about the effects of subsequent substitutions on the accuracy of available recombination-detection approaches. RESULTS: We assessed the effect of subsequent substitutions on the detection of simulated recombination events within sets of four nucleotide sequences under a homogeneous evolutionary model. The amount of subsequent substitutions per site, prior evolutionary history of the sequences, and reciprocality or non-reciprocality of the recombination event all affected the accuracy of the recombination-detecting programs examined. Bayesian phylogenetic-based approaches showed high accuracy in detecting evidence of recombination event and in identifying recombination breakpoints. These approaches were less sensitive to parameter settings than other methods we tested, making them easier to apply to various data sets in a consistent manner. CONCLUSION: Post-recombination substitutions tend to diminish the predictive accuracy of recombination-detecting programs. The best method for detecting recombined regions is not necessarily the most accurate in identifying recombination breakpoints. For difficult detection problems involving highly divergent sequences or large data sets, different types of approach can be run in succession to increase efficiency, and can potentially yield better predictive accuracy than any single method used in isolation.
format Text
id pubmed-1592127
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15921272006-10-05 Detecting recombination in evolving nucleotide sequences Chan, Cheong Xin Beiko, Robert G Ragan, Mark A BMC Bioinformatics Research Article BACKGROUND: Genetic recombination can produce heterogeneous phylogenetic histories within a set of homologous genes. These recombination events can be obscured by subsequent residue substitutions, which consequently complicate their detection. While there are many algorithms for the identification of recombination events, little is known about the effects of subsequent substitutions on the accuracy of available recombination-detection approaches. RESULTS: We assessed the effect of subsequent substitutions on the detection of simulated recombination events within sets of four nucleotide sequences under a homogeneous evolutionary model. The amount of subsequent substitutions per site, prior evolutionary history of the sequences, and reciprocality or non-reciprocality of the recombination event all affected the accuracy of the recombination-detecting programs examined. Bayesian phylogenetic-based approaches showed high accuracy in detecting evidence of recombination event and in identifying recombination breakpoints. These approaches were less sensitive to parameter settings than other methods we tested, making them easier to apply to various data sets in a consistent manner. CONCLUSION: Post-recombination substitutions tend to diminish the predictive accuracy of recombination-detecting programs. The best method for detecting recombined regions is not necessarily the most accurate in identifying recombination breakpoints. For difficult detection problems involving highly divergent sequences or large data sets, different types of approach can be run in succession to increase efficiency, and can potentially yield better predictive accuracy than any single method used in isolation. BioMed Central 2006-09-18 /pmc/articles/PMC1592127/ /pubmed/16978423 http://dx.doi.org/10.1186/1471-2105-7-412 Text en Copyright © 2006 Chan et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Chan, Cheong Xin
Beiko, Robert G
Ragan, Mark A
Detecting recombination in evolving nucleotide sequences
title Detecting recombination in evolving nucleotide sequences
title_full Detecting recombination in evolving nucleotide sequences
title_fullStr Detecting recombination in evolving nucleotide sequences
title_full_unstemmed Detecting recombination in evolving nucleotide sequences
title_short Detecting recombination in evolving nucleotide sequences
title_sort detecting recombination in evolving nucleotide sequences
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1592127/
https://www.ncbi.nlm.nih.gov/pubmed/16978423
http://dx.doi.org/10.1186/1471-2105-7-412
work_keys_str_mv AT chancheongxin detectingrecombinationinevolvingnucleotidesequences
AT beikorobertg detectingrecombinationinevolvingnucleotidesequences
AT raganmarka detectingrecombinationinevolvingnucleotidesequences