Cargando…

Inferring the Direction of Introgression Using Genomic Sequence Data

Genomic data are informative about the history of species divergence and interspecific gene flow, including the direction, timing, and strength of gene flow. However, gene flow in opposite directions generates similar patterns in multilocus sequence data, such as reduced sequence divergence between...

Descripción completa

Detalles Bibliográficos
Autores principales: Thawornwattana, Yuttapong, Huang, Jun, Flouri, Tomáš, Mallet, James, Yang, Ziheng
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439365/
https://www.ncbi.nlm.nih.gov/pubmed/37552932
http://dx.doi.org/10.1093/molbev/msad178
_version_ 1785092928901742592
author Thawornwattana, Yuttapong
Huang, Jun
Flouri, Tomáš
Mallet, James
Yang, Ziheng
author_facet Thawornwattana, Yuttapong
Huang, Jun
Flouri, Tomáš
Mallet, James
Yang, Ziheng
author_sort Thawornwattana, Yuttapong
collection PubMed
description Genomic data are informative about the history of species divergence and interspecific gene flow, including the direction, timing, and strength of gene flow. However, gene flow in opposite directions generates similar patterns in multilocus sequence data, such as reduced sequence divergence between the hybridizing species. As a result, inference of the direction of gene flow is challenging. Here, we investigate the information about the direction of gene flow present in genomic sequence data using likelihood-based methods under the multispecies-coalescent-with-introgression model. We analyze the case of two species, and use simulation to examine cases with three or four species. We find that it is easier to infer gene flow from a small population to a large one than in the opposite direction, and easier to infer inflow (gene flow from outgroup species to an ingroup species) than outflow (gene flow from an ingroup species to an outgroup species). It is also easier to infer gene flow if there is a longer time of separate evolution between the initial divergence and subsequent introgression. When introgression is assumed to occur in the wrong direction, the time of introgression tends to be correctly estimated and the Bayesian test of gene flow is often significant, while estimates of introgression probability can be even greater than the true probability. We analyze genomic sequences from Heliconius butterflies to demonstrate that typical genomic datasets are informative about the direction of interspecific gene flow, as well as its timing and strength.
format Online
Article
Text
id pubmed-10439365
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-104393652023-08-20 Inferring the Direction of Introgression Using Genomic Sequence Data Thawornwattana, Yuttapong Huang, Jun Flouri, Tomáš Mallet, James Yang, Ziheng Mol Biol Evol Methods Genomic data are informative about the history of species divergence and interspecific gene flow, including the direction, timing, and strength of gene flow. However, gene flow in opposite directions generates similar patterns in multilocus sequence data, such as reduced sequence divergence between the hybridizing species. As a result, inference of the direction of gene flow is challenging. Here, we investigate the information about the direction of gene flow present in genomic sequence data using likelihood-based methods under the multispecies-coalescent-with-introgression model. We analyze the case of two species, and use simulation to examine cases with three or four species. We find that it is easier to infer gene flow from a small population to a large one than in the opposite direction, and easier to infer inflow (gene flow from outgroup species to an ingroup species) than outflow (gene flow from an ingroup species to an outgroup species). It is also easier to infer gene flow if there is a longer time of separate evolution between the initial divergence and subsequent introgression. When introgression is assumed to occur in the wrong direction, the time of introgression tends to be correctly estimated and the Bayesian test of gene flow is often significant, while estimates of introgression probability can be even greater than the true probability. We analyze genomic sequences from Heliconius butterflies to demonstrate that typical genomic datasets are informative about the direction of interspecific gene flow, as well as its timing and strength. Oxford University Press 2023-08-08 /pmc/articles/PMC10439365/ /pubmed/37552932 http://dx.doi.org/10.1093/molbev/msad178 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methods
Thawornwattana, Yuttapong
Huang, Jun
Flouri, Tomáš
Mallet, James
Yang, Ziheng
Inferring the Direction of Introgression Using Genomic Sequence Data
title Inferring the Direction of Introgression Using Genomic Sequence Data
title_full Inferring the Direction of Introgression Using Genomic Sequence Data
title_fullStr Inferring the Direction of Introgression Using Genomic Sequence Data
title_full_unstemmed Inferring the Direction of Introgression Using Genomic Sequence Data
title_short Inferring the Direction of Introgression Using Genomic Sequence Data
title_sort inferring the direction of introgression using genomic sequence data
topic Methods
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10439365/
https://www.ncbi.nlm.nih.gov/pubmed/37552932
http://dx.doi.org/10.1093/molbev/msad178
work_keys_str_mv AT thawornwattanayuttapong inferringthedirectionofintrogressionusinggenomicsequencedata
AT huangjun inferringthedirectionofintrogressionusinggenomicsequencedata
AT flouritomas inferringthedirectionofintrogressionusinggenomicsequencedata
AT malletjames inferringthedirectionofintrogressionusinggenomicsequencedata
AT yangziheng inferringthedirectionofintrogressionusinggenomicsequencedata