Cargando…

MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations

Adaptive introgression (AI) facilitates local adaptation in a wide range of species. Many state-of-the-art methods detect AI with ad-hoc approaches that identify summary statistic outliers or intersect scans for positive selection with scans for introgressed genomic regions. Although widely used, ap...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xinjun, Kim, Bernard, Singh, Armaan, Sankararaman, Sriram, Durvasula, Arun, Lohmueller, Kirk E
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9887621/
https://www.ncbi.nlm.nih.gov/pubmed/36617238
http://dx.doi.org/10.1093/molbev/msad001
_version_ 1784880378107920384
author Zhang, Xinjun
Kim, Bernard
Singh, Armaan
Sankararaman, Sriram
Durvasula, Arun
Lohmueller, Kirk E
author_facet Zhang, Xinjun
Kim, Bernard
Singh, Armaan
Sankararaman, Sriram
Durvasula, Arun
Lohmueller, Kirk E
author_sort Zhang, Xinjun
collection PubMed
description Adaptive introgression (AI) facilitates local adaptation in a wide range of species. Many state-of-the-art methods detect AI with ad-hoc approaches that identify summary statistic outliers or intersect scans for positive selection with scans for introgressed genomic regions. Although widely used, approaches intersecting outliers are vulnerable to a high false-negative rate as the power of different methods varies, especially for complex introgression events. Moreover, population genetic processes unrelated to AI, such as background selection or heterosis, may create similar genomic signals to AI, compromising the reliability of methods that rely on neutral null distributions. In recent years, machine learning (ML) methods have been increasingly applied to population genetic questions. Here, we present a ML-based method called MaLAdapt for identifying AI loci from genome-wide sequencing data. Using an Extra-Trees Classifier algorithm, our method combines information from a large number of biologically meaningful summary statistics to capture a powerful composite signature of AI across the genome. In contrast to existing methods, MaLAdapt is especially well-powered to detect AI with mild beneficial effects, including selection on standing archaic variation, and is robust to non-AI selective sweeps, heterosis from deleterious mutations, and demographic misspecification. Furthermore, MaLAdapt outperforms existing methods for detecting AI based on the analysis of simulated data and the validation of empirical signals through visual inspection of haplotype patterns. We apply MaLAdapt to the 1000 Genomes Project human genomic data and discover novel AI candidate regions in non-African populations, including genes that are enriched in functionally important biological pathways regulating metabolism and immune responses.
format Online
Article
Text
id pubmed-9887621
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-98876212023-02-01 MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations Zhang, Xinjun Kim, Bernard Singh, Armaan Sankararaman, Sriram Durvasula, Arun Lohmueller, Kirk E Mol Biol Evol Discoveries Adaptive introgression (AI) facilitates local adaptation in a wide range of species. Many state-of-the-art methods detect AI with ad-hoc approaches that identify summary statistic outliers or intersect scans for positive selection with scans for introgressed genomic regions. Although widely used, approaches intersecting outliers are vulnerable to a high false-negative rate as the power of different methods varies, especially for complex introgression events. Moreover, population genetic processes unrelated to AI, such as background selection or heterosis, may create similar genomic signals to AI, compromising the reliability of methods that rely on neutral null distributions. In recent years, machine learning (ML) methods have been increasingly applied to population genetic questions. Here, we present a ML-based method called MaLAdapt for identifying AI loci from genome-wide sequencing data. Using an Extra-Trees Classifier algorithm, our method combines information from a large number of biologically meaningful summary statistics to capture a powerful composite signature of AI across the genome. In contrast to existing methods, MaLAdapt is especially well-powered to detect AI with mild beneficial effects, including selection on standing archaic variation, and is robust to non-AI selective sweeps, heterosis from deleterious mutations, and demographic misspecification. Furthermore, MaLAdapt outperforms existing methods for detecting AI based on the analysis of simulated data and the validation of empirical signals through visual inspection of haplotype patterns. We apply MaLAdapt to the 1000 Genomes Project human genomic data and discover novel AI candidate regions in non-African populations, including genes that are enriched in functionally important biological pathways regulating metabolism and immune responses. Oxford University Press 2023-01-07 /pmc/articles/PMC9887621/ /pubmed/36617238 http://dx.doi.org/10.1093/molbev/msad001 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Discoveries
Zhang, Xinjun
Kim, Bernard
Singh, Armaan
Sankararaman, Sriram
Durvasula, Arun
Lohmueller, Kirk E
MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations
title MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations
title_full MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations
title_fullStr MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations
title_full_unstemmed MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations
title_short MaLAdapt Reveals Novel Targets of Adaptive Introgression From Neanderthals and Denisovans in Worldwide Human Populations
title_sort maladapt reveals novel targets of adaptive introgression from neanderthals and denisovans in worldwide human populations
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9887621/
https://www.ncbi.nlm.nih.gov/pubmed/36617238
http://dx.doi.org/10.1093/molbev/msad001
work_keys_str_mv AT zhangxinjun maladaptrevealsnoveltargetsofadaptiveintrogressionfromneanderthalsanddenisovansinworldwidehumanpopulations
AT kimbernard maladaptrevealsnoveltargetsofadaptiveintrogressionfromneanderthalsanddenisovansinworldwidehumanpopulations
AT singharmaan maladaptrevealsnoveltargetsofadaptiveintrogressionfromneanderthalsanddenisovansinworldwidehumanpopulations
AT sankararamansriram maladaptrevealsnoveltargetsofadaptiveintrogressionfromneanderthalsanddenisovansinworldwidehumanpopulations
AT durvasulaarun maladaptrevealsnoveltargetsofadaptiveintrogressionfromneanderthalsanddenisovansinworldwidehumanpopulations
AT lohmuellerkirke maladaptrevealsnoveltargetsofadaptiveintrogressionfromneanderthalsanddenisovansinworldwidehumanpopulations