Cargando…

Using the nucleotide substitution rate matrix to detect horizontal gene transfer

BACKGROUND: Horizontal gene transfer (HGT) has allowed bacteria to evolve many new capabilities. Because transferred genes perform many medically important functions, such as conferring antibiotic resistance, improved detection of horizontally transferred genes from sequence data would be an importa...

Descripción completa

Detalles Bibliográficos
Autores principales: Hamady, Micah, Betterton, M D, Knight, Rob
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1657035/
https://www.ncbi.nlm.nih.gov/pubmed/17067382
http://dx.doi.org/10.1186/1471-2105-7-476
_version_ 1782131025694949376
author Hamady, Micah
Betterton, M D
Knight, Rob
author_facet Hamady, Micah
Betterton, M D
Knight, Rob
author_sort Hamady, Micah
collection PubMed
description BACKGROUND: Horizontal gene transfer (HGT) has allowed bacteria to evolve many new capabilities. Because transferred genes perform many medically important functions, such as conferring antibiotic resistance, improved detection of horizontally transferred genes from sequence data would be an important advance. Existing sequence-based methods for detecting HGT focus on changes in nucleotide composition or on differences between gene and genome phylogenies; these methods have high error rates. RESULTS: First, we introduce a new class of methods for detecting HGT based on the changes in nucleotide substitution rates that occur when a gene is transferred to a new organism. Our new methods discriminate simulated HGT events with an error rate up to 10 times lower than does GC content. Use of models that are not time-reversible is crucial for detecting HGT. Second, we show that using combinations of multiple predictors of HGT offers substantial improvements over using any single predictor, yielding as much as a factor of 18 improvement in performance (a maximum reduction in error rate from 38% to about 3%). Multiple predictors were combined by using the random forests machine learning algorithm to identify optimal classifiers that separate HGT from non-HGT trees. CONCLUSION: The new class of HGT-detection methods introduced here combines advantages of phylogenetic and compositional HGT-detection techniques. These new techniques offer order-of-magnitude improvements over compositional methods because they are better able to discriminate HGT from non-HGT trees under a wide range of simulated conditions. We also found that combining multiple measures of HGT is essential for detecting a wide range of HGT events. These novel indicators of horizontal transfer will be widely useful in detecting HGT events linked to the evolution of important bacterial traits, such as antibiotic resistance and pathogenicity.
format Text
id pubmed-1657035
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-16570352006-11-22 Using the nucleotide substitution rate matrix to detect horizontal gene transfer Hamady, Micah Betterton, M D Knight, Rob BMC Bioinformatics Research Article BACKGROUND: Horizontal gene transfer (HGT) has allowed bacteria to evolve many new capabilities. Because transferred genes perform many medically important functions, such as conferring antibiotic resistance, improved detection of horizontally transferred genes from sequence data would be an important advance. Existing sequence-based methods for detecting HGT focus on changes in nucleotide composition or on differences between gene and genome phylogenies; these methods have high error rates. RESULTS: First, we introduce a new class of methods for detecting HGT based on the changes in nucleotide substitution rates that occur when a gene is transferred to a new organism. Our new methods discriminate simulated HGT events with an error rate up to 10 times lower than does GC content. Use of models that are not time-reversible is crucial for detecting HGT. Second, we show that using combinations of multiple predictors of HGT offers substantial improvements over using any single predictor, yielding as much as a factor of 18 improvement in performance (a maximum reduction in error rate from 38% to about 3%). Multiple predictors were combined by using the random forests machine learning algorithm to identify optimal classifiers that separate HGT from non-HGT trees. CONCLUSION: The new class of HGT-detection methods introduced here combines advantages of phylogenetic and compositional HGT-detection techniques. These new techniques offer order-of-magnitude improvements over compositional methods because they are better able to discriminate HGT from non-HGT trees under a wide range of simulated conditions. We also found that combining multiple measures of HGT is essential for detecting a wide range of HGT events. These novel indicators of horizontal transfer will be widely useful in detecting HGT events linked to the evolution of important bacterial traits, such as antibiotic resistance and pathogenicity. BioMed Central 2006-10-26 /pmc/articles/PMC1657035/ /pubmed/17067382 http://dx.doi.org/10.1186/1471-2105-7-476 Text en Copyright © 2006 Hamady et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hamady, Micah
Betterton, M D
Knight, Rob
Using the nucleotide substitution rate matrix to detect horizontal gene transfer
title Using the nucleotide substitution rate matrix to detect horizontal gene transfer
title_full Using the nucleotide substitution rate matrix to detect horizontal gene transfer
title_fullStr Using the nucleotide substitution rate matrix to detect horizontal gene transfer
title_full_unstemmed Using the nucleotide substitution rate matrix to detect horizontal gene transfer
title_short Using the nucleotide substitution rate matrix to detect horizontal gene transfer
title_sort using the nucleotide substitution rate matrix to detect horizontal gene transfer
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1657035/
https://www.ncbi.nlm.nih.gov/pubmed/17067382
http://dx.doi.org/10.1186/1471-2105-7-476
work_keys_str_mv AT hamadymicah usingthenucleotidesubstitutionratematrixtodetecthorizontalgenetransfer
AT bettertonmd usingthenucleotidesubstitutionratematrixtodetecthorizontalgenetransfer
AT knightrob usingthenucleotidesubstitutionratematrixtodetecthorizontalgenetransfer