Cargando…
Using the nucleotide substitution rate matrix to detect horizontal gene transfer
BACKGROUND: Horizontal gene transfer (HGT) has allowed bacteria to evolve many new capabilities. Because transferred genes perform many medically important functions, such as conferring antibiotic resistance, improved detection of horizontally transferred genes from sequence data would be an importa...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1657035/ https://www.ncbi.nlm.nih.gov/pubmed/17067382 http://dx.doi.org/10.1186/1471-2105-7-476 |
_version_ | 1782131025694949376 |
---|---|
author | Hamady, Micah Betterton, M D Knight, Rob |
author_facet | Hamady, Micah Betterton, M D Knight, Rob |
author_sort | Hamady, Micah |
collection | PubMed |
description | BACKGROUND: Horizontal gene transfer (HGT) has allowed bacteria to evolve many new capabilities. Because transferred genes perform many medically important functions, such as conferring antibiotic resistance, improved detection of horizontally transferred genes from sequence data would be an important advance. Existing sequence-based methods for detecting HGT focus on changes in nucleotide composition or on differences between gene and genome phylogenies; these methods have high error rates. RESULTS: First, we introduce a new class of methods for detecting HGT based on the changes in nucleotide substitution rates that occur when a gene is transferred to a new organism. Our new methods discriminate simulated HGT events with an error rate up to 10 times lower than does GC content. Use of models that are not time-reversible is crucial for detecting HGT. Second, we show that using combinations of multiple predictors of HGT offers substantial improvements over using any single predictor, yielding as much as a factor of 18 improvement in performance (a maximum reduction in error rate from 38% to about 3%). Multiple predictors were combined by using the random forests machine learning algorithm to identify optimal classifiers that separate HGT from non-HGT trees. CONCLUSION: The new class of HGT-detection methods introduced here combines advantages of phylogenetic and compositional HGT-detection techniques. These new techniques offer order-of-magnitude improvements over compositional methods because they are better able to discriminate HGT from non-HGT trees under a wide range of simulated conditions. We also found that combining multiple measures of HGT is essential for detecting a wide range of HGT events. These novel indicators of horizontal transfer will be widely useful in detecting HGT events linked to the evolution of important bacterial traits, such as antibiotic resistance and pathogenicity. |
format | Text |
id | pubmed-1657035 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-16570352006-11-22 Using the nucleotide substitution rate matrix to detect horizontal gene transfer Hamady, Micah Betterton, M D Knight, Rob BMC Bioinformatics Research Article BACKGROUND: Horizontal gene transfer (HGT) has allowed bacteria to evolve many new capabilities. Because transferred genes perform many medically important functions, such as conferring antibiotic resistance, improved detection of horizontally transferred genes from sequence data would be an important advance. Existing sequence-based methods for detecting HGT focus on changes in nucleotide composition or on differences between gene and genome phylogenies; these methods have high error rates. RESULTS: First, we introduce a new class of methods for detecting HGT based on the changes in nucleotide substitution rates that occur when a gene is transferred to a new organism. Our new methods discriminate simulated HGT events with an error rate up to 10 times lower than does GC content. Use of models that are not time-reversible is crucial for detecting HGT. Second, we show that using combinations of multiple predictors of HGT offers substantial improvements over using any single predictor, yielding as much as a factor of 18 improvement in performance (a maximum reduction in error rate from 38% to about 3%). Multiple predictors were combined by using the random forests machine learning algorithm to identify optimal classifiers that separate HGT from non-HGT trees. CONCLUSION: The new class of HGT-detection methods introduced here combines advantages of phylogenetic and compositional HGT-detection techniques. These new techniques offer order-of-magnitude improvements over compositional methods because they are better able to discriminate HGT from non-HGT trees under a wide range of simulated conditions. We also found that combining multiple measures of HGT is essential for detecting a wide range of HGT events. These novel indicators of horizontal transfer will be widely useful in detecting HGT events linked to the evolution of important bacterial traits, such as antibiotic resistance and pathogenicity. BioMed Central 2006-10-26 /pmc/articles/PMC1657035/ /pubmed/17067382 http://dx.doi.org/10.1186/1471-2105-7-476 Text en Copyright © 2006 Hamady et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Hamady, Micah Betterton, M D Knight, Rob Using the nucleotide substitution rate matrix to detect horizontal gene transfer |
title | Using the nucleotide substitution rate matrix to detect horizontal gene transfer |
title_full | Using the nucleotide substitution rate matrix to detect horizontal gene transfer |
title_fullStr | Using the nucleotide substitution rate matrix to detect horizontal gene transfer |
title_full_unstemmed | Using the nucleotide substitution rate matrix to detect horizontal gene transfer |
title_short | Using the nucleotide substitution rate matrix to detect horizontal gene transfer |
title_sort | using the nucleotide substitution rate matrix to detect horizontal gene transfer |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1657035/ https://www.ncbi.nlm.nih.gov/pubmed/17067382 http://dx.doi.org/10.1186/1471-2105-7-476 |
work_keys_str_mv | AT hamadymicah usingthenucleotidesubstitutionratematrixtodetecthorizontalgenetransfer AT bettertonmd usingthenucleotidesubstitutionratematrixtodetecthorizontalgenetransfer AT knightrob usingthenucleotidesubstitutionratematrixtodetecthorizontalgenetransfer |