Cargando…

Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow

Phylogenetic trees based on genome-wide sequence data may not always represent the true evolutionary history for a variety of reasons. One process that can lead to incorrect reconstruction of species phylogenies is gene flow, especially if interspecific gene flow has affected large parts of the geno...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Dezhi, Rheindt, Frank E, She, Huishang, Cheng, Yalin, Song, Gang, Jia, Chenxi, Qu, Yanhua, Alström, Per, Lei, Fumin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357342/
https://www.ncbi.nlm.nih.gov/pubmed/33787929
http://dx.doi.org/10.1093/sysbio/syab024
_version_ 1783737109698314240
author Zhang, Dezhi
Rheindt, Frank E
She, Huishang
Cheng, Yalin
Song, Gang
Jia, Chenxi
Qu, Yanhua
Alström, Per
Lei, Fumin
author_facet Zhang, Dezhi
Rheindt, Frank E
She, Huishang
Cheng, Yalin
Song, Gang
Jia, Chenxi
Qu, Yanhua
Alström, Per
Lei, Fumin
author_sort Zhang, Dezhi
collection PubMed
description Phylogenetic trees based on genome-wide sequence data may not always represent the true evolutionary history for a variety of reasons. One process that can lead to incorrect reconstruction of species phylogenies is gene flow, especially if interspecific gene flow has affected large parts of the genome. We investigated phylogenetic relationships within a clade comprising eight species of passerine birds (Phylloscopidae, Phylloscopus, leaf warblers) using one de novo genome assembly and 78 resequenced genomes. On the basis of hypothesis-exclusion trials based on D-statistics, phylogenetic network analysis, and demographic inference analysis, we identified ancient gene flow affecting large parts of the genome between one species and the ancestral lineage of a sister species pair. This ancient gene flow consistently caused erroneous reconstruction of the phylogeny when using large amounts of genome-wide sequence data. In contrast, the true relationships were captured when smaller parts of the genome were analyzed, showing that the “winner-takes-all democratic majority tree” is not necessarily the true species tree. Under this condition, smaller amounts of data may sometimes avoid the effects of gene flow due to stochastic sampling, as hidden reticulation histories are more likely to emerge from the use of larger data sets, especially whole-genome data sets. In addition, we also found that genomic regions affected by ancient gene flow generally exhibited higher genomic differentiation but a lower recombination rate and nucleotide diversity. Our study highlights the importance of considering reticulation in phylogenetic reconstructions in the genomic era.[Bifurcation; introgression; recombination; reticulation; Phylloscopus.]
format Online
Article
Text
id pubmed-8357342
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-83573422021-08-12 Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow Zhang, Dezhi Rheindt, Frank E She, Huishang Cheng, Yalin Song, Gang Jia, Chenxi Qu, Yanhua Alström, Per Lei, Fumin Syst Biol Regular Articles Phylogenetic trees based on genome-wide sequence data may not always represent the true evolutionary history for a variety of reasons. One process that can lead to incorrect reconstruction of species phylogenies is gene flow, especially if interspecific gene flow has affected large parts of the genome. We investigated phylogenetic relationships within a clade comprising eight species of passerine birds (Phylloscopidae, Phylloscopus, leaf warblers) using one de novo genome assembly and 78 resequenced genomes. On the basis of hypothesis-exclusion trials based on D-statistics, phylogenetic network analysis, and demographic inference analysis, we identified ancient gene flow affecting large parts of the genome between one species and the ancestral lineage of a sister species pair. This ancient gene flow consistently caused erroneous reconstruction of the phylogeny when using large amounts of genome-wide sequence data. In contrast, the true relationships were captured when smaller parts of the genome were analyzed, showing that the “winner-takes-all democratic majority tree” is not necessarily the true species tree. Under this condition, smaller amounts of data may sometimes avoid the effects of gene flow due to stochastic sampling, as hidden reticulation histories are more likely to emerge from the use of larger data sets, especially whole-genome data sets. In addition, we also found that genomic regions affected by ancient gene flow generally exhibited higher genomic differentiation but a lower recombination rate and nucleotide diversity. Our study highlights the importance of considering reticulation in phylogenetic reconstructions in the genomic era.[Bifurcation; introgression; recombination; reticulation; Phylloscopus.] Oxford University Press 2021-03-31 /pmc/articles/PMC8357342/ /pubmed/33787929 http://dx.doi.org/10.1093/sysbio/syab024 Text en © The Author(s) 2021. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) ), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercialre-use, please contact journals.permissions@oup.com
spellingShingle Regular Articles
Zhang, Dezhi
Rheindt, Frank E
She, Huishang
Cheng, Yalin
Song, Gang
Jia, Chenxi
Qu, Yanhua
Alström, Per
Lei, Fumin
Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow
title Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow
title_full Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow
title_fullStr Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow
title_full_unstemmed Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow
title_short Most Genomic Loci Misrepresent the Phylogeny of an Avian Radiation Because of Ancient Gene Flow
title_sort most genomic loci misrepresent the phylogeny of an avian radiation because of ancient gene flow
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8357342/
https://www.ncbi.nlm.nih.gov/pubmed/33787929
http://dx.doi.org/10.1093/sysbio/syab024
work_keys_str_mv AT zhangdezhi mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT rheindtfranke mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT shehuishang mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT chengyalin mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT songgang mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT jiachenxi mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT quyanhua mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT alstromper mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow
AT leifumin mostgenomiclocimisrepresentthephylogenyofanavianradiationbecauseofancientgeneflow