Cargando…

Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi

Lateral gene transfer (LGT, also known as horizontal gene transfer), an atypical mechanism of transferring genes between species, has almost become the default explanation for genes that display an unexpected composition or phylogeny. Numerous methods of detecting LGT events all rely on two fundamen...

Descripción completa

Detalles Bibliográficos
Autores principales: Dupont, Pierre-Yves, Cox, Murray P.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5386878/
https://www.ncbi.nlm.nih.gov/pubmed/28235827
http://dx.doi.org/10.1534/g3.116.038448
_version_ 1782520845014401024
author Dupont, Pierre-Yves
Cox, Murray P.
author_facet Dupont, Pierre-Yves
Cox, Murray P.
author_sort Dupont, Pierre-Yves
collection PubMed
description Lateral gene transfer (LGT, also known as horizontal gene transfer), an atypical mechanism of transferring genes between species, has almost become the default explanation for genes that display an unexpected composition or phylogeny. Numerous methods of detecting LGT events all rely on two fundamental strategies: primary structure composition or gene tree/species tree comparisons. Discouragingly, the results of these different approaches rarely coincide. With the wealth of genome data now available, detection of laterally transferred genes is increasingly being attempted in large uncurated eukaryotic datasets. However, detection methods depend greatly on the quality of the underlying genomic data, which are typically complex for eukaryotes. Furthermore, given the automated nature of genomic data collection, it is typically impractical to manually verify all protein or gene models, orthology predictions, and multiple sequence alignments, requiring researchers to accept a substantial margin of error in their datasets. Using a test case comprising plant-associated genomes across the fungal kingdom, this study reveals that composition- and phylogeny-based methods have little statistical power to detect laterally transferred genes. In particular, phylogenetic methods reveal extreme levels of topological variation in fungal gene trees, the vast majority of which show departures from the canonical species tree. Therefore, it is inherently challenging to detect LGT events in typical eukaryotic genomes. This finding is in striking contrast to the large number of claims for laterally transferred genes in eukaryotic species that routinely appear in the literature, and questions how many of these proposed examples are statistically well supported.
format Online
Article
Text
id pubmed-5386878
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-53868782017-04-13 Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi Dupont, Pierre-Yves Cox, Murray P. G3 (Bethesda) Investigations Lateral gene transfer (LGT, also known as horizontal gene transfer), an atypical mechanism of transferring genes between species, has almost become the default explanation for genes that display an unexpected composition or phylogeny. Numerous methods of detecting LGT events all rely on two fundamental strategies: primary structure composition or gene tree/species tree comparisons. Discouragingly, the results of these different approaches rarely coincide. With the wealth of genome data now available, detection of laterally transferred genes is increasingly being attempted in large uncurated eukaryotic datasets. However, detection methods depend greatly on the quality of the underlying genomic data, which are typically complex for eukaryotes. Furthermore, given the automated nature of genomic data collection, it is typically impractical to manually verify all protein or gene models, orthology predictions, and multiple sequence alignments, requiring researchers to accept a substantial margin of error in their datasets. Using a test case comprising plant-associated genomes across the fungal kingdom, this study reveals that composition- and phylogeny-based methods have little statistical power to detect laterally transferred genes. In particular, phylogenetic methods reveal extreme levels of topological variation in fungal gene trees, the vast majority of which show departures from the canonical species tree. Therefore, it is inherently challenging to detect LGT events in typical eukaryotic genomes. This finding is in striking contrast to the large number of claims for laterally transferred genes in eukaryotic species that routinely appear in the literature, and questions how many of these proposed examples are statistically well supported. Genetics Society of America 2017-02-23 /pmc/articles/PMC5386878/ /pubmed/28235827 http://dx.doi.org/10.1534/g3.116.038448 Text en Copyright © 2017 Dupont and Cox http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
Dupont, Pierre-Yves
Cox, Murray P.
Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi
title Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi
title_full Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi
title_fullStr Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi
title_full_unstemmed Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi
title_short Genomic Data Quality Impacts Automated Detection of Lateral Gene Transfer in Fungi
title_sort genomic data quality impacts automated detection of lateral gene transfer in fungi
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5386878/
https://www.ncbi.nlm.nih.gov/pubmed/28235827
http://dx.doi.org/10.1534/g3.116.038448
work_keys_str_mv AT dupontpierreyves genomicdataqualityimpactsautomateddetectionoflateralgenetransferinfungi
AT coxmurrayp genomicdataqualityimpactsautomateddetectionoflateralgenetransferinfungi