Cargando…
Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data
Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765611/ https://www.ncbi.nlm.nih.gov/pubmed/35041674 http://dx.doi.org/10.1371/journal.pone.0262024 |
_version_ | 1784634352319070208 |
---|---|
author | Klassmann, Alexander Gautier, Mathieu |
author_facet | Klassmann, Alexander Gautier, Mathieu |
author_sort | Klassmann, Alexander |
collection | PubMed |
description | Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some of them necessitate polarized variants. Here, we unify and extend previously proposed modifications to loosen these requirements. We compare the modified versions with the original ones by measuring the false discovery rate in simulated whole-genome scans and by quantifying the overlap of inferred candidate regions in empirical data. We find that phasing information is indispensable for accurate estimation of within-population statistics (for all but very large samples) and of cross-population statistics for small samples. Ancestry information, in contrast, is of lesser importance for both types of statistic. Our publicly available R package rehh incorporates the modified statistics presented here. |
format | Online Article Text |
id | pubmed-8765611 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-87656112022-01-19 Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data Klassmann, Alexander Gautier, Mathieu PLoS One Research Article Analysis of population genetic data often includes a search for genomic regions with signs of recent positive selection. One of such approaches involves the concept of extended haplotype homozygosity (EHH) and its associated statistics. These statistics typically require phased haplotypes, and some of them necessitate polarized variants. Here, we unify and extend previously proposed modifications to loosen these requirements. We compare the modified versions with the original ones by measuring the false discovery rate in simulated whole-genome scans and by quantifying the overlap of inferred candidate regions in empirical data. We find that phasing information is indispensable for accurate estimation of within-population statistics (for all but very large samples) and of cross-population statistics for small samples. Ancestry information, in contrast, is of lesser importance for both types of statistic. Our publicly available R package rehh incorporates the modified statistics presented here. Public Library of Science 2022-01-18 /pmc/articles/PMC8765611/ /pubmed/35041674 http://dx.doi.org/10.1371/journal.pone.0262024 Text en © 2022 Klassmann, Gautier https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Klassmann, Alexander Gautier, Mathieu Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data |
title | Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data |
title_full | Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data |
title_fullStr | Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data |
title_full_unstemmed | Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data |
title_short | Detecting selection using extended haplotype homozygosity (EHH)-based statistics in unphased or unpolarized data |
title_sort | detecting selection using extended haplotype homozygosity (ehh)-based statistics in unphased or unpolarized data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8765611/ https://www.ncbi.nlm.nih.gov/pubmed/35041674 http://dx.doi.org/10.1371/journal.pone.0262024 |
work_keys_str_mv | AT klassmannalexander detectingselectionusingextendedhaplotypehomozygosityehhbasedstatisticsinunphasedorunpolarizeddata AT gautiermathieu detectingselectionusingextendedhaplotypehomozygosityehhbasedstatisticsinunphasedorunpolarizeddata |