Cargando…

Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data

Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some...

Descripción completa

Detalles Bibliográficos
Autores principales: Klassmann, Alexander, Gautier, Mathieu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765611/
https://www.ncbi.nlm.nih.gov/pubmed/35041674
http://dx.doi.org/10.1371/journal.pone.0262024
_version_ 1784634352319070208
author Klassmann, Alexander
Gautier, Mathieu
author_facet Klassmann, Alexander
Gautier, Mathieu
author_sort Klassmann, Alexander
collection PubMed
description Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some of them necessitate polarized variants. Here, we unify and extend previously proposed modifications to loosen these requirements. We compare the modified versions with the original ones by measuring the false discovery rate in simulated whole-genome scans and by quantifying the overlap of inferred candidate regions in empirical data. We find that phasing information is indispensable for accurate estimation of within-population statistics (for all but very large samples) and of cross-population statistics for small samples. Ancestry information, in contrast, is of lesser importance for both types of statistic. Our publicly available R package rehh incorporates the modified statistics presented here.
format Online
Article
Text
id pubmed-8765611
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-87656112022-01-19 Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data Klassmann, Alexander Gautier, Mathieu PLoS One Research Article Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some of them necessitate polarized variants. Here, we unify and extend previously proposed modifications to loosen these requirements. We compare the modified versions with the original ones by measuring the false discovery rate in simulated whole-genome scans and by quantifying the overlap of inferred candidate regions in empirical data. We find that phasing information is indispensable for accurate estimation of within-population statistics (for all but very large samples) and of cross-population statistics for small samples. Ancestry information, in contrast, is of lesser importance for both types of statistic. Our publicly available R package rehh incorporates the modified statistics presented here. Public Library of Science 2022-01-18 /pmc/articles/PMC8765611/ /pubmed/35041674 http://dx.doi.org/10.1371/journal.pone.0262024 Text en © 2022 Klassmann, Gautier https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Klassmann, Alexander
Gautier, Mathieu
Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
title Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
title_full Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
title_fullStr Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
title_full_unstemmed Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
title_short Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
title_sort detecting selection using extended haplotype homozygosity (ehh)-based statistics in unphased or unpolarized data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765611/
https://www.ncbi.nlm.nih.gov/pubmed/35041674
http://dx.doi.org/10.1371/journal.pone.0262024
work_keys_str_mv AT klassmannalexander detectingselectionusingextendedhaplotypehomozygosityehhbasedstatisticsinunphasedorunpolarizeddata
AT gautiermathieu detectingselectionusingextendedhaplotypehomozygosityehhbasedstatisticsinunphasedorunpolarizeddata